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REMARKS 

Applicants have cancelled Claims 1-3, and 7-10 without prejudice to, or disclaimer of, the 
subject matter contained therein. Applicants maintain that the cancellation of a claim makes no 
admission as to its patentability and reserve the right to pursue the subject matter of the cancelled 
claim in this or any other patent application. 

Applicants have amended Claims 4, 5, 6 and 14 to delete elements (a)-(d). Claims 4 and 
5 are amended to delete the portion of the claim "wherein said isolated nucleic acid encodes a 
polypeptide that is more highly expressed in normal esophagus than in esophageal tumor." 
Claim 14 is amended to include "or a complement thereof to amended elements (a)-(c), and the 
following text "wherein said isolated nucleic acid molecule is suitable for use as a PCR primer, 
or probe; and wherein said isolated nucleic acid is at least about 20 nucleotides in length." Claim 
16 is amended to read "at least about 50 nucleotides in length." Claim 17 is amended to depend 
from Claim 4. New Claims 21-31 have been added. 

Applicants submit that no new matter has been added by the amendments, and that 
support for the amendments can be found throughout the specification. Support for the 
amendments to Claim 14 can be found, for example, at paragraphs [0012], [0317], and [0327] of 
the specification. Support for the amendment to Claim 16 and new Claims 21-25 can be found, 
for example, at paragraph [0012]. Support for new Claims 26-31 can be found, for example, in 
the claims as originally filed, and paragraphs [0227] and [0317]. 

Applicants thank the Examiner for the review of the instant application, withdrawing the 
objection to the specification, and deleting the requested inventors. Applicants acknowledge the 
Examiner's partial withdrawal of the rejection of Claims 1-5, 8-10 and 12-20 under 35 U.S.C. § 
1 12, first paragraph, for lack of written description, and the withdrawal of the rejection of Claims 
1-13 under 35 U.S.C. § 112, second paragraph, as being indefinite. The rejections of the 
presently pending claims are respectfully traversed. Claims 4-6, 11-14, and 16-31 are presented 
for examination. 
Priority 

The Examiner maintains that since the claimed invention does not have utility, the 
priority date of the present application is its filing date, May 8, 2002. 
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Applicants maintain that SEQ ID NOs:85 and 86 were first disclosed in U.S. Provisional 
Application 60/100930 filed 9/17/1998 as SEQ ID NOs:l and 2 and Figures 1, 2A and 2B. The 
data in Example 18 (Tumor Versus Normal Differential Tissue Expression Distribution) relied 
on in part for the utility of the claimed nucleic acids, were first disclosed in PCT Application 
PCT/USOO/23328 filed 8/24/2000, on page 93 line 3 through page 96, line 35. As discussed 
below in detail, the claimed invention has utility. Thus, Applicants are fully entitled to the 
priority date of at least August 24, 2000. 

Rejection Under 35 U.S.C. $101 - Utility 

The PTO maintains its rejection of Claims 1-14 and 16-20 under 35 U.S.C. § 101 as 
lacking utility for the reasons set forth on pages 3-10 of the previous Office Action. The PTO 
asserts that the data of Example 18 "showing a very small increase in DNA copy number - the 
numerical increase is unknown - in normal esophagus tissues" are insufficient, because "there is 
no evidence regarding whether or not PRO 1302 mRNA or polypeptide levels are also increased 
in these tissues." Office Action at 4. According to the PTO, "one skilled in the art would not 
assume that a small increase in gene copy number would correlate with significantly increased 
mRNA or polypeptide levels." Office Action at 5. 

The PTO also argues that "[t]he PRO 1302 gene has not been associated with tumor 
formation or the development of cancer... No mutation or translocation of PRO1302 has been 
associated with esophagus tissue." Office Action at 7. "It is not known whether PRO1302 is 
overexpressed in cancerous esophagus or other cancers, and what the relative levels of 
expression are." Office Action at 7. In addition, "[o]ne cannot determine from the data in the 
specification whether the observed 'amplification' of nucleic acid is due to increase in 
chromosomal copy number, or alternatively due to an increase in transcription rates." Office 
Action at 7. The PTO concludes that in the absence of any of the above information, the 
disclosure is insufficient to meet the utility requirement. 

Applicants respectfully disagree. 
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Utility - Legal Standard 

According to the Utility Examination Guidelines ("Utility Guidelines"), 66 Fed. Reg. 
1092 (2001) an invention complies with the utility requirement of 35 U.S.C. § 101, if it has at 
least one asserted "specific, substantial, and credible utility" or a "well-established utility." 

Under the Utility Guidelines, a utility is "specific" when it is particular to the subject 
matter claimed. For example, it is generally not enough to state that a nucleic acid is useful as a 
diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "The basic 
quid pro quo contemplated by the Constitution and the Congress for granting a patent monopoly 
is the benefit derived by the public from an invention with substantial utility." In explaining the 
"substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office personnel must 
be careful not to interpret the phrase "immediate benefit to the public" or similar formulations 
used in certain court decisions to mean that products or services based on the claimed invention 
must be "currently available" to the public in order to satisfy the utility requirement. "Rather, any 
reasonable use that an applicant has identified for the invention that can be viewed as providing 
a public benefit should be accepted as sufficient, at least with regard to defining a 'substantial' 
utility." (M.P.E.P. § 2107.01, emphasis added). 

The mere consideration that further experimentation might be performed to more fully 
develop the claimed subject matter does not support a finding of lack of utility. M.P.E.P. § 
2107.01 m cites In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) in stating 
that "Usefulness in patent law . . . necessarily includes the expectation of further research and 
development. The stage at which an invention in this field becomes useful is well before it is 
ready to be administered to humans." Further, "[T]o violate § 101 the claimed device must be 
totally incapable of achieving a useful result" Juicy Whip Inc. v. Orange Bang Inc., 51 USPQ2d 
1700 (Fed. Cir. 1999), citing Brooktree Corp. v. Advanced Micro Devices, Inc., 977 F.2d 1555, 
1571 (Fed.Cir.1992). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
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practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt 
(i.e., "question") the truth of the statement of utility." M.P.E.P. § 2107.02 m A. The M.P.E.P. 
cautions that: 

Rejections under 35 U.S.C. 101 have been rarely sustained by federal 
courts. Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was 
sustained either because the applicant ... asserted a utility that could only be 
true if it violated a scientific principle, such as the second law of 
thermodynamics, or a law of nature, or was wholly inconsistent with 
contemporary knowledge in the art. M.P.E.P. § 2107.02 m B. (underline 
emphasis in original, bold emphasis added); citing In re Gazave, 379 F.2d 973, 
978, 154 USPQ 92, 96 (CCPA 1967). 

Utility need NOT be Proved to a Statistical Certainty - a Reasonable Correlation between the 

Evidence and the Asserted Utility is Sufficient 

An Applicant's assertion of utility creates a presumption of utility that will be sufficient to 

satisfy the utility requirement of 35 U.S.C. § 101, "unless there is a reason for one skilled in the 

art to question the objective truth of the statement of utility or its scope." In re hanger, 503 F.2d 

1380, 1391, 183 USPQ 288, 297 (CCPA 1974). See t also In reJolles, 628 F.2d 1322, 206 USPQ 

885 (CCPA 1980); In re Irons, 340 F.2d 974, 144 USPQ 351 (1965); In re Sichert, 566 F.2d 

1154, 1159, 196 USPQ 209, 212-13 (CCPA 1977). Compliance with 35 U.S.C. § 101 is a 

question of fact. Raytheon v. Roper, 724 F.2d 951, 956, 220 USPQ 592, 596 (Fed. Cir. 1983) 

cert, denied, 469 US 835 (1984). The evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 USPQ2d 1443, 1444 (Fed. Cir. 1992). 

This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
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utility is more likely than not true . MP.E.P. at § 2107.02, part VII (2004) 
(underline emphasis in original, bold emphasis added, internal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 
would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely prove 
that the asserted utility is real. Rather, the evidence only needs to be reasonably indicative of the 
asserted utility. 

In Fujikawa v. Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996), the Court 

of Appeals for the Federal Circuit upheld a PTO decision that in vitro testing of a novel 

pharmaceutical compound was sufficient to establish practical utility, stating the following rule: 

[TJesting is often required to establish practical utility. But the test results need 
not absolutely prove that the compound is pharmacologically active. All that is 
required is that the tests be "reasonably indicative of the desired 
[pharmacological] response." In other words, there must be a sufficient 
correlation between the tests and an asserted pharmacological activity so as to 
convince those skilled in the art, to a reasonable probability, that the novel 
compound will exhibit the asserted pharmacological behavior." Fujikawa v. 
Wattanasin, 93 F.3d 1559, 1564, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996) (internal 
citations omitted, bold emphasis added, italics in original). 

While the Fujikawa case was in the context of utility for pharmaceutical compounds, the 
principals stated by the Court are applicable in the instant case where the asserted utility is for a 
therapeutic and diagnostic use - utility does not have to be established to an absolute certainty, 
rather, the evidence must convince a person of skill in the art "to a reasonable probability." In 
addition, the evidence need not be direct, so long as there is a "sufficient correlation" between 
the tests performed and the asserted utility. 

Thus, the legal standard for demonstrating utility is a relatively low hurdle. An Applicant 
need only provide evidence such that it is more likely than not that a person of skill in the art 
would be convinced, to a reasonable probability, that the asserted utility is true. The 
evidence need not be direct evidence, so long as there is a reasonable correlation between the 
evidence and the asserted utility. The Applicant does not need to provide evidence such that it 
establishes an asserted utility as a matter of statistical certainty. 
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Even assuming that the PTO has met its initial burden to offer evidence that one of 
ordinary skill in the art would reasonably doubt the truth of the asserted utility, Applicants assert 
that they have met their burden of providing rebuttal evidence such that it is more likely than not 
those skilled in the art, to a reasonable probability, would believe that the claimed invention is 
useful as a diagnostic tool for cancer. 

Substantial Utility 

The Data in Example 18 are Data Regarding Differential mRNA Levels, not Gene Amplification 

Applicants begin by clarifying that the data concerning the differential expression of the 
PRO 1302 gene presented in Example 18 relate to gene expression, not gene amplification . The 
description of Example 18 makes clear that the results were obtained by quantitative PCR 
amplification of cDNA libraries. It is well known in the art that cDNA libraries are made from 
mRNA, and reflect the level of mRNA for a particular gene in the source tissue. Thus, Example 
18 is reporting a measure of the expression of the PRO1302 gene, i.e. mRNA levels, not its 
amplification, i.e. the number of copies of PRO1302 in the genome. 

Throughout the final Office Action, the PTO refers to Example 18 as "amplification 
data." For example, the PTO states "[g]iven the small and indeterminate increase in DNA copy 
number of PRO1302. . .one skilled in the art would not assume that a small increase in gene copy 
number would correlate with significantly increased mRNA or polypeptide levels." Office 
Action at 5 (emphasis added). Similarly, the PTO states that the specification provides "no 
evidence regarding whether or not PRO 1302 mRNA or polypeptide levels are also increased." 
Office Action at 4 (emphasis added). In rejecting the declaration of Chris Grimaldi, the PTO 
states that "it is important to note that the instant specification provides no information regarding 
decreased protein, DNA or mRNA levels of PRO1302 in tumor samples as contrasted to normal 
tissue samples:.... Only gene amplification data were presented ." Office Action at 6-7 (emphasis 
added). 

Applicants submit that the PTO has misinterpreted the data presented in this application. 
Clearly, Example 18 reports data regarding differential mRNA levels, not gene amplification 
data. 

The PTO relies on Pennica et ah to teach that "what is often seen is a lack of correlation 
between DNA amplification and increased peptide levels." Office Action at 4 (emphasis in 
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original). Whether or not gene amplification leads to increased gene expression is irrelevant to 
this particular application. Likewise, whether the differential mRNA expression of the PRO 1302 
gene reported in Example 18 is due to an increase or decrease in copy number, or alternatively 
due to an increase or decrease in transcription rates, is simply not relevant. Applicants have 
provided reliable evidence that the PRO1302 mRNA is differentially expressed in certain tumors. 
Whether this differential expression is due to changes in gene copy number, transcription rates, a 
combination of the two, or some other known or unknown cellular mechanism is simply not 
relevant to Applicants' asserted utility. 

Summary of Applicants ' Arguments and the PTO s Response 

In an attempt to clarify Applicants' argument, Applicants offer a summary of their 
argument and the disputed issues involved. 

1. Applicants assert they have provided reliable evidence that mRNA for the PRO1302 
polypeptide is expressed at least two-fold higher in normal esophagus compared to esophageal 
tumor, and therefore the claimed nucleic acids are useful as diagnostic tools. Applicants are not 
asserting that the claimed nucleic acids will necessarily provide a definitive diagnosis of cancer, 
but rather that they are useful, alone or in combination with other diagnostic tools to assist in the 
diagnosis of esophageal cancer. 

2. Applicants submit that it is not necessary to know what role the PRO1302 gene plays 
in cancer to use its differential expression as a diagnostic tool. 

3. It is not required to prove that the PRO 1302 polypeptide is also differentially 
expressed in esophageal tumor to establish the utility of the claimed nucleic acids. 

Applicants understand the PTO to be making several arguments in response to 
Applicants' asserted utility: 

1. The PTO has challenged the reliability of the evidence reported in Example 18, and 
states that it provides no information regarding differential mRNA expression; 

2. The PTO cites Pennica et al to support its position that gene amplification is not 
necessarily correlated to gene expression ; 

3. The PTO cites Hu et al and Haynes et al to support its assertion that the literature 
cautions against drawing conclusions based on small changes in transcript expression levels 
between normal and cancerous tissue, and that mRNA levels are not predictive of protein levels; 
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4. The PTO concludes that based on the cited literature, one of skill in the art would not 
assume that the increase in gene amplificatio n of PRO 1302 would correlate with increased 
mRNA or polypeptide levels. Therefore, further research needs to be done to determine if the 
increase in PRO 1302 DNA supports a role for the peptide in any tissue. (See final Office Action 
at 5). 

As detailed below, Applicants submit that the PTO has failed to meet its initial burden to 
offer evidence "that one of ordinary skill in the art would reasonably doubt the asserted utility." 
In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). First, the PTO has failed 
to offer any evidence to support its rejection of the data in Example 18 and the Declaration of 
Chris Grimaldi in support of these data. Second, as discussed above and can be seen from 
Applicants' summary of their argument, Applicants submit that any lack of correlation between 
gene amplification and gene expression is not at issue in this application and therefore the 
Pennica et at reference is not relevant. Third, Applicants submit that given the well-established 
correlation between a change in the level of mRNA with a corresponding change in the levels of 
the encoded protein, the PRO1302 protein is likely differentially expressed in esophageal tumors. 
However, utility for the pending claims does not rely on whether the encoded polypeptide is 
overexpressed , and as such whether or not increased levels of PRO 1302 mRNA correlate with 
increased levels of PRO 1302 protein is not presently an issue. 

Applicants have established that the Gene Encoding the PRO1302 Polypeptide is Differentially 
Expressed in Certain Cancers compared to Normal Tissue 

Applicants first address the PTO's argument that the evidence of differential expression 
of the gene encoding the PRO 1302 polypeptide in esophageal tumors is insufficient. Applicants 
submit that the gene expression data provided in Example 18 of the present application are 
sufficient to establish that the PRO 1302 gene is differentially expressed in esophageal tumors. 

The gene expression data in the specification, Example 18, shows that the mRNA 
associated with protein PRO 13 02 was more highly expressed in normal esophagus compared to 
esophageal tumor. Gene expression was analyzed using standard semi-quantitative PCR 
amplification reactions of cDNA libraries isolated from different human tumor and normal 
human tissue samples. Identification of the differential expression of the PRO 1302 polypeptide- 
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encoding gene in tumor tissue compared to the corresponding normal tissue renders the molecule 

useful as a diagnostic tool for the determination of the presence or absence of tumor. Applicants 

previously submitted as Exhibit A a first Declaration of J. Christopher Grimaldi, an expert in the 

field of cancer biology. This declaration explains the importance of the data in Example 18, and 

how differential gene and protein expression studies are used to differentiate between normal and 

tumor tissue (see Declaration, paragraph 7). 

In paragraph 5 of his declaration, Mr. Grimaldi states that the gene expression studies 

reported in Example 18 of the instant application were made from pooled samples of normal and 

of tumor tissues. Contrary to the PTO's assertions that this makes the data unreliable, Mr. 

Grimaldi explains that: 

The DNA libraries used in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely 
to be accurate than data obtained from a sample from a single individual That 
is, the detection of variations in gene expression is likely to represent a more 
generally relevant condition when pooled samples from normal tissues are 
compared with pooled samples from tumors in the same tissue type. 
(Paragraph 5) (emphasis added). 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 
employed to generate the data of Example 18 is sufficient to determine if a gene is over- or 
under-expressed in tumor cells compared to corresponding normal tissue. He states that any 
visually detectable difference seen between two samples is indicative of at least a two-fold 
difference in cDNA between the tumor tissue and the counterpart normal tissue. Thus, the 
results of Example 18 reflect at least a two-fold difference between normal and tumor samples. 
He also states that the results of the gene expression studies indicate that the genes of interest 
"can be used to differentiate tumor from normal," thus establishing their reliability. He explains 
that, contrary to the PTO's assertions, "The precise levels of gene expression are irrelevant; what 
matters is that there is a relative difference in expression between normal tissue and tumor 
tissue." (Paragraph 7). Thus, since it is the relative level of expression between normal tissue 
and suspected cancerous tissue that is important, the precise level of expression in normal tissue 
is irrelevant. Likewise, there is no need for quantitative data to compare the level of expression 
in normal and tumor tissue. As Mr. Grimaldi states, "If a difference is detected, this indicates 
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that the gene and its corresponding polypeptide and antibodies against the polypeptide are useful 
for diagnostic purposes, to screen samples to differentiate between normal and tumor." 

Applicants submit that the declaration of Mr. Grimaldi is based on personal knowledge of 
the relevant facts at issue. Mr. Grimaldi is an expert in the field and conducted or supervised the 
experiments at issue. Applicants remind the PTO that "[o]ffice personnel must accept an opinion 
from a qualified expert that is based upon relevant facts whose accuracy is not being questioned." 
PTO Utility Examination Guidelines (2001) (emphasis added). In addition, declarations relating 
to issues of fact should not be summarily dismissed as "opinions" without an adequate 
explanation of how the declaration fails to rebut the Examiner's position. In re Alton 76 F.3d 
1168 (Fed. Cir. 1996). 

The PTO has not supplied any reasons or evidence to question the accuracy of the facts 
upon which Mr. Grimaldi based his opinion. Mr. Grimaldi has personal knowledge of the 
relevant facts, has based his opinion on those facts, and the PTO has offered no reason or 
evidence to reject either the underlying facts or his opinion. Therefore, the PTO should accept 
Mr. Grimaldi 's opinion with regard to his statement that "any visually detectable difference seen 
between two samples is indicative of at least a two-fold difference in cDNA between the tumor 
tissue and the counterpart normal tissue" and that the nucleic acids of interest "can be used to 
differentiate tumor from normal." Together, these statements establish that there is at least a 
two-fold difference in expression, and that the results are reliable enough that they can be used to 
distinguish tumor from normal tissue. 

After summarizing the first Grimaldi declaration, the PTO states "[o]ne cannot determine 
from the data in the specification whether the observed 'amplification' of nucleic acid is due to 
increase in chromosomal copy number, or alternatively due to increase in transcription rates," 
and that the specification does not provide any information regarding "decreased protein, DNA, 
or mRNA levels of PRO 13 02 in tumor samples as contrasted to normal tissue samples: Only 
gene amplification data were presented." Office Action at 7. 

As an initial matter, Applicant's again point out that the data in Example 18 are gene 
expression data, not gene amplification data. The specification and the first Grimaldi 
Declaration make clear that Example 18 used semi-quantitative PCR of cDNA libraries. 
Therefore, one of skill in the art would know that Example 18 is a measure of mRNA levels, and 
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reflects differential PRO 1302 gene expression, not gene amplification. As discussed above, the 
PTO cites Pennica et al for support of the argument that gene amplification does not necessarily 
lead to increased gene expression. This reference is irrelevant to the instant application which 
reports differential gene expression, not gene amplification. Therefore, the reference and the 
PTO's arguments regarding gene amplification do not support the PTO's challenge of the 
sufficiency of the Example 18 data, or the first Grimaldi Declaration. 

The PTO also cites Hu et al (J. Proteome Res., 2(4):405-12 (2003)) for support for its 
assertion the literature cautions researchers from drawing conclusions based on small changes in 
transcript expression levels between normal and cancerous tissue. The PTO states that Hu 
teaches that not all genes with increased expression in cancer have a known or published role in 
cancer. 

In Hu, the researchers used an automated literature-mining tool to summarize and 
estimate the relative strengths of all human gene-disease relationships published on Medline. 
They then generated a microarray expression dataset comparing breast cancer and normal breast 
tissue. Using their data-mining tool, they looked for a correlation between the strength of the 
literature association between the gene and breast cancer, and the magnitude of the difference in 
expression level. They report that for genes displaying a 5-fold change or less in tumors 
compared to normal, there was no evidence of a correlation between altered gene expression and 
a known role in the disease. See Hu at 411. However, among genes with a 10-fold or more 
change in expression level, there was a strong correlation between expression level and a 
published role in the disease. Id. at 412. Importantly, Hu reports that the observed correlation 
was only found among estrogen receptor-positive tumors, not less-prevalent ER-negative tumors. 
Id 

The general findings of Hu are not surprising - one would expect that genes with the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of publications 
reporting a connection with the disease. The correlation reported in Hu only indicates that the 
greater the change in expression level, the more likely it is that there is a published or known role 
for the gene in the disease, as found by their automated literature-mining software. Thus, Hu's 
results merely reflect a bias in the literature toward studying the most prominent targets, and 
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reflect nothing regarding the ability of a gene that is 2-fold or more differentially expressed in 
tumors to serve as a disease marker. Hu acknowledges the shortcomings of this method in 
explaining the disparity in Hu's findings for ER-negative versus ER-positive tumors: Hu 
attributes the "bias in the literature" toward the more prevalent ER-positive tumors as the 
explanation for the lack of any correlation between number of publications and gene expression 
levels in less-prevalent (and, therefore, less studied) ER-negative tumors. Id. Because of this 
intrinsic bias, Hu's methodology is unlikely to ever note a correlation of a disease with less 
differentially-expressed genes and their corresponding proteins, regardless of whether or not an 
actual relationship between the disease and less differentially-expressed genes exists. 
Accordingly, Hu's methodology yields results that provide little or no information regarding 
biological significance of genes with less than 5-fold expression change in disease. 

Applicants submit that a lack of known role for PRO1302 in cancer does not prevent its 
use as a diagnostic tool for cancer. There is a difference between use of a gene for distinguishing 
between tumor and normal tissue on the one hand, and establishing a role for the gene in cancer 
on the other. Genes with lower levels of change in expression may or may not be the most 
important genes in causing the disease, but the genes can still show a consistent and measurable 
change in expression. While such genes may or may not be good targets for further research, 
they can nonetheless be used as diagnostic tools. Thus, Hu does not refute the Applicants' 
assertion that the PRO1302 gene can be used as a cancer diagnostic tool because it is 
differentially expressed in esophageal tumors. 

In addition, whether the differential expression of the PRO1302 gene is due to an increase 
in chromosomal copy number or increased transcription rates is not relevant to whether the 
difference in expression can be used to distinguish tumor from normal tissue. Similarly, the fact 
that there is no known translocation or mutation of PRO 1302 is irrelevant to whether its 
differential expression can be used to assist in diagnosis of cancer - one does not need to know 
why PRO1302 is differentially expressed, or what the consequence of the differential expression 
is, in order to exploit the differential expression to distinguish tumor from normal tissue. 

The PTO has recognized that the utility of a nucleic acid does not depend on the function 
of the encoded gene product. The Utility Examination Guidelines published on January 5, 2001 
state "In addition, the utility of a claimed DNA does not necessarily depend on the function of 
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the encoded gene product. A claimed DNA may have a specific and substantial utility because, 
e.g. it hybridizes near a disease-associated gene or it has a gene regulating activity." (Federal 
Register, Volume 66, page 1095, Comment 14). While Applicants appreciate that actions taken 
in other applications are not binding on the PTO with respect to the present application, 
Applicants note that the PTO issues patents relating to nucleic acids which are useful for 
diagnosing particular conditions regardless of whether the nucleic acids are the causative agent 
for the condition. For example, polymorphisms which are indicative of a predisposition to a 
particular condition are patentable (see, e.g., U.S. Patent No. 6,465,185, U.S. Patent No. 
6,228,582, and U.S. Patent No.6,162,604 submitted herewith as Exhibits 1-3), even though they 
may or may not cause the disease itself. Similarly, the present nucleic acids which are useful for 
determining whether an individual has cancer are useful regardless of whether or not they are the 
cause of the cancer. 

The position of the PTO requiring a known role for PRO1302 in cancer for utility is also 
inconsistent with the analogous standard for therapeutic utility of a compound where "the mere 
identification of a pharmacological activity of a compound that is relevant to an asserted 
pharmacological use provides an 'immediate benefit to the public' and thus satisfies the utility 
requirement." M.P.E.P. §2701.01 (emphasis original). Here, the mere identification of altered 
expression in tumors is relevant to diagnosis of tumors, and, therefore, provides an immediate 
benefit to the public 

In conclusion, Applicants submit that the evidence reported in Example 18, combined 
with the first Grimaldi Declaration, establish that there is at least a two-fold difference in 
PRO 1302 cDNA between esophageal tumor tissue and normal esophageal tissue. Therefore, it 
follows that expression levels of the PRO1302 gene can be used to distinguish esophageal tumor 
tissue from normal esophageal tissue. The PTO has not offered any significant arguments or 
evidence to the contrary. Applicants have therefore established a utility for the claimed nucleic 
acids as diagnostic tools for cancer, particularly esophageal tumors. 
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Applicants have established that the Accepted Understands in the Art is that there is a Direct 
Correlation between mRNA Levels and the Level of Expression of the Encoded Protein 

Because the claims have been amended such that the claimed nucleic acids are not 
defined by the sequence of the polypeptide they encode, the question of whether there is a 
correlation between changes in gene expression and changes in protein expression is not 
presently at issue. However, Applicants submit that they have established for the record that it is 
well-established in the art that a change in the level of mRNA for a particular protein, generally 
leads to a corresponding change in the level of the encoded protein. Given Applicants' evidence 
of differential expression of the mRNA for the PRO 1302 polypeptide in esophageal tumor, it is 
more likely than not that the PRO 1302 polypeptide is also differentially expressed. 

The PTO cites Haynes et al (Electrophoresis, 19(1 1): 1862-71 (1998)) as support for the 
assertion that "polypeptide levels cannot be accurately predicted from mRNA levels, and that 
according to their results, the ratio varies from zero to 50-fold." Office Action at 4-5. Because 
the claims have been amended such that the claimed nucleic acids are not defined by the 
sequence of the polypeptide they encode, the question of whether there is a correlation between 
changes in gene expression and changes in protein expression is not presently at issue . Thus, the 
Haynes reference is not relevant. However, even if this issue were important for the utility of the 
claimed nucleic acids, Haynes does not support the PTO's position. 

Haynes studied whether there is a correlation between the level of mRNA expression and 
the level of protein expression for 80 selected genes from yeast. The genes were selected 
because they constituted a relatively homogeneous group with respect to predicted half-life and 
expression level of the protein products. See Haynes at 1863. Haynes did not examine whether a 
change in transcript level for a particular gene led to a change in the level of expression of the 
corresponding protein. Instead, Haynes determined whether the steady-state transcript level 
correlated with the steady-state level of the corresponding protein based on an analysis of 80 
different genes. 

Haynes reported to have "found a general trend but no strong correlation between protein 
and transcript levels." Id. However, a cursory inspection of Fig. 1 shows a clear correlation 
between the mRNA levels and protein levels measured. This correlation is confirmed by an 
inspection of the full-length research paper from which the data in Fig. 1 were derived, presented 
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herein as Exhibit 4 (Gygi et al., Molecular and Cellular Biology, Mar. 1999, 1720-1730). Gygi 
states that "there was a general trend of increased protein levels resulting from increased mRNA 
levels," with a correlation coefficient of 0.935, indicating a strong correlation. Gygi at 1726. 
Moreover, Gygi also states that the correlation is especially strong for highly expressed mRNAs. 
Id. Thus, it is not clear that Haynes even supports the Examiner's position, as Haynes did report 
a general trend, and Gygi reports a strong correlation between increasing mRNA levels and 
increasing protein levels. 

The PTO focuses on the portion of Haynes where the authors reported that for some of 
the studied genes with equivalent mRNA levels, there were differences in corresponding protein 
expression, including some that varied by more than 50-fold. Similarly, Haynes reports that 
different proteins with similar expression levels were maintained by transcript levels that varied 
by as much as 40-fold. Id. Thus, Haynes showed that for one type of yeast, similar mRNA 
levels for different genes did not universally result in equivalent protein levels for the different 
gene products, and similar protein levels for different gene products did not universally result 
from equivalent mRNA levels for the different genes. These results are expected, since there are 
many factors that determine translation efficiency for a given transcript, or the half-life of the 
encoded protein. Not surprisingly, based on these results, Haynes concluded that protein levels 
cannot always be accurately predicted from the level of the corresponding mRNA transcript when 
looking at the level of transcripts across different genes. 

Importantly, Haynes did not say that for a single gene, the level of mRNA transcript is not 
positively correlated with the level of protein expression. Applicants have asserted that 
increasing or decreasing the level of mRNA for the same gene leads to a increase or decrease for 
the corresponding protein. Haynes did not study this issue and says absolutely nothing about it. 
Therefore, Haynes is not inconsistent with or contradictory to the utility of the instant claims, and 
offers no support for the PTO's position. 

And even if Haynes supported the PTO's argument, which it does not, one contrary 
example does not establish that one of skill in the art would find it is more likely than not there is 
no general correlation between changes in mRNA level and changes in protein level for an 
individual gene. In fact, the working hypothesis among those skilled in the art, as illustrated by 
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the evidence presented by Applicants, is that there is a positive correlation between changes in 
mRNA levels and changes in protein levels for a particular gene. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Applicants previously submitted a copy of a second Declaration by J. Christopher 
Grimaldi, an expert in the field of cancer biology (previously submitted as Exhibit B). As stated 
in paragraph 5 of the declaration, "Those who work in this field are well aware that in the vast 
majority of cases, when a gene is over-expressed... the gene product or polypeptide will also be 
over-expressed.... This same principal applies to gene under-expression." Further, "the 
detection of increased mRNA expression is expected to result in increased polypeptide 
expression, and the detection of decreased mRNA expression is expected to result in decreased 
polypeptide expression. The detection of increased or decreased polypeptide expression can be 
used for cancer diagnosis and treatment." The references cited in the declaration and submitted 
herewith support this statement. 

Applicants also previously submitted a copy of the declaration of Paul Polakis, Ph.D. 

(previously submitted as Exhibit C), an expert in the field of cancer biology. As stated in 

paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases] and my knowledge of the relevant scientific literature, it is my 
considered scientific opinion that for human genes, an increased level of mRNA 
in a tumor cell relative to a normal cell typically correlates to a similar increase in 
abundance of the encoded protein in the tumor cell relative to the normal cell. In 
fact, it remains a central dogma in molecular biology that increased mRNA levels 
are predictive of corresponding increased levels of the encoded protein. 
(Emphasis added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that "such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." (Polakis Declaration, 
paragraph 6). 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et aL, Molecular Biology of 
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the Cell (3 rd ed. 1994) (submitted herewith as Exhibit 5) and (4 th ed. 2002) (submitted herewith 
as Exhibit 6)). Figure 9-2 of Exhibit 6 shows the steps at which eukaryotic gene expression can 
be controlled. The first step depicted is transcriptional control. Exhibit 5 provides that "[f]or 
most genes transcriptional controls are paramount. This makes sense because, of all the possible 
control points illustrated in Figure 9-2, only transcriptional control ensures that no superfluous 
intermediates are synthesized." Exhibit 5 at 403 (emphasis added). In addition, the text states 
that "Although controls on the initiation of gene transcription are the predominant form of 
regulation for most genes , other controls can act later in the pathway from RNA to protein to 
modulate the amount of gene product that is made." Exhibit 5 at 453 (emphasis added). Thus, as 
established in Exhibit 5, the predominant mechanism for regulating the amount of protein 
produced is by regulating transcription initiation. 

In Exhibit 6, Figure 6-3 on page 302 illustrates the basic principle that there is a 
correlation between increased gene expression and increased protein expression. The 
accompanying text states that "a cell can change (or regulate) the expression of each of its genes 
according to the needs of the moment - most obviously by controlling the production of its 
mRNA" Exhibit 6 at 302 (emphasis added). Similarly, Figure 6-90 on page 364 of Exhibit 6 
illustrates the path from gene to protein. The accompanying text states that while potentially 
each step can be regulated by the cell, " the initiation of transcription is the most common point 
for a cell to regulate the expression of each of its genes ." Exhibit 6 at 364 (emphasis added). 
This point is repeated on page 379, where the authors state that of all the possible points for 
regulating protein expression, " [fjor most genes transcriptional controls are paramount ." Exhibit 
6 at 379 (emphasis added). 

Further support for Applicants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) (submitted herewith as Exhibit 7) which states "having 
acknowledged that control of gene expression can occur at multiple stages, and that production of 
RNA cannot inevitably be equated with production of protein, it is clear that the overwhelming 
majority of regulatory events occur at the initiation of transcription ." Genes VI at 847-848 
(emphasis added). 

Additional support is also found in Zhigang et ai, World Journal of Surgical Oncology 
2:13, 2004, submitted herewith as Exhibit 8. Zhigang studied the expression of prostate stem 
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cell antigen (PSCA) protein and mRNA to validate it as a potential molecular target for diagnosis 

and treatment of human prostate cancer. The data showed "a high degree of correlation between 

PSCA protein and mRNA expression" Exhibit 8 at 4. Of the samples tested, 81 out of 87 

showed a high degree of correlation between mRNA expression and protein expression. The 

authors conclude that "it is demonstrated that PSCA protein and mRNA overexpressed in human 

prostate cancer, and that the increased protein level of PSCA was resulted from the upregulated 

transcription of its mRNA." Exhibit 8 at 6. Even though the correlation between mRNA 

expression and protein expression occurred in 93% of the samples tested, not 100%, the authors 

state that "PSCA may be a promising molecular marker for the clinical prognosis of human Pea 

and a valuable target for diagnosis and therapy of this tumor." Exhibit 8 at 7. 

Further, Meric et aL, Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), submitted 

herewith as Exhibit 9, states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the level 
of mRNA, which can be attributable to either DNA amplification or to differences 
in transcription. Meric et aL at 971 (emphasis added). 

Those of skill in the art would not be focusing on differences in gene expression between cancer 
cells and normal cells if there were no correlation between gene expression and protein 
expression. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, and the 
excerpts and references provided above all establish that the accepted understanding in the art is 
that there is a reasonable correlation between changes in gene expression and the level of the 
encoded protein. 

In response to the second Grimaldi Declaration and the Polakis Declaration, the PTO 
states that "it is important to note that the instant specification provides no information regarding 
decreased mRNA levels of PRO 1302.... Only gene amplification data were presented. 
Therefore, the declaration is insufficient to overcome the rejection of claims 1-14 and 16-20... 
since it is limited to a discussion of data regarding the correlation of mRNA levels and 
polypeptide levels." Office Action at 8-9. 
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Applicants again reiterate that Example 18 is information regarding differential mRNA 
levels of PRO1302. Thus, the PTO's rejection of the second Grimaldi Declaration and Polakis 
Declaration because they discuss the correlation of mRNA levels and polypeptide levels is 
misplaced. Accordingly, Applicants submit that they have offered sufficient evidence to 
establish that it is more likely than not that one of skill in the art would believe that because the 
PRO 1302 mRNA is more highly expressed in normal esophagus compared to esophageal tumor, 
the PRO 1302 polypeptide will also be more highly expressed in normal esophagus compared to 
esophageal tumor. 

The References cited bv the PTO are Not Sufficient to satisfy the PTO's Initial Burden of 
Offering Evidence "that one o f ordinary skill in the art would reasonably doubt the asserted 
utility" 

As stated above, an Applicant's assertion of utility creates a presumption of utility that 

will be sufficient to satisfy the utility requirement of 35 U.S.C. § 101, "unless there is a reason 

for one skilled in the art to question the objective truth of the statement of utility or its scope." In 

re hanger, 503 F.2d 1380, 1391, 183 USPQ 288, 297 (CCPA 1974). The evidentiary standard to 

be used throughout ex parte examination in setting forth a rejection is a preponderance of the 

evidence, or "more likely than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 USPQ2d 

1443, 1444 (Fed. Cir. 1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
utility is more likely than not true . M.P.E.P. at § 2107.02, part VII (2004) 
(underline emphasis in original, bold emphasis added, internal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 
would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely prove 
that the asserted utility is real . Rather, the evidence only needs to be reasonably indicative of the 
asserted utility. 
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Applicants remind the PTO that the M.P.E.P. cautions that rejections for lack of utility 
are rarely sustained by federal courts, and that generally speaking, a utility rejection was 
sustained because the applicant asserted a utility "that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, or was 
wholly inconsistent with contemporary knowledge in the art." M.P.E.P. § 2107.02 m B., 
citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 (CCPA 1967) (underline emphasis 
in original, bold emphasis added). Rather than being wholly inconsistent with contemporary 
knowledge in the art, Applicants' asserted utility is squarely within the teaching of leading 
textbooks in the field, and is supported by references and the declarations of skilled experts. 

There is nothing in the cited references which cast any doubt on the Applicants' asserted 
utility. As stated above, the Pennica et al reference is not relevant because Example 18 reports 
gene expression data, not gene amplification data. Similarly, Haynes et al and Hu et al are also 
not sufficient to satisfy the PTO's burden of offering evidence that a person of skill in the art 
would have a reasonable doubt that a gene differentially expressed in certain tumors can be used 
as a diagnostic tool since neither reference addresses this issue. Given the lack of support for the 
PTO's position, and the supporting evidence provided by the Applicants for their position, one of 
skill in the art would be more likely than not to believe that the claimed nucleic acids can be used 
as diagnostic tools for cancer, particularly esophageal tumors. 

Specific Utility 

The Asserted Substantial Utilities are Specific to the Claimed Nucleic Acids 

Applicants next address the PTO's assertion that the asserted utilities are not specific to 

the claimed nucleic acids related to PRO 1302. Applicants respectfully disagree. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 

contrast to "a general utility that would be applicable to the broad class of the invention." 

M.P.E.P. § 2107.01 I. Applicants submit that the evidence of differential expression of the 

PRO 1302 gene in certain types of tumor cells, along with the declarations and references 

discussed above, provide a specific utility for the claimed nucleic acids. 

As discussed above, there are significant data which show that that the gene for the 

PRO 13 02 polypeptide is expressed at least two-fold higher in normal esophagus compared to 
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esophageal tumor tissue. These data are strong evidence that the PRO 13 02 gene is associated 
with esophageal tumors. Thus, contrary to the assertions of the PTO, Applicants submit that they 
have provided evidence associating the PRO 13 02 gene with a specific disease. The asserted 
utility as a diagnostic tool for cancer, particularly esophageal tumor, is a specific utility - it is not 
a general utility that would apply to the broad class of nucleic acids. 

Conclusion 

The PTO has asserted the following arguments to support its conclusion that based on the 
cited literature, one of skill in the art would not assume that the increase in gene amplification of 
PRO1302 would correlate with increased mRNA or polypeptide levels: (1) the PTO has 
challenged the reliability of the evidence reported in Example 18; (2) the PTO cites Pennica et al 
to support its position that gene amplification is not necessarily correlated to gene expression; 
and (3) the PTO cites Hu et al and Haynes et al to support its assertion that the literature 
cautions against drawing conclusions based on small changes in transcript expression levels 
between normal and cancerous tissue, and that mRNA levels are not predictive of protein levels. 
The PTO states that further research needs to be done to determine if the increase or decrease in 
PRO 1302 DNA supports a role for the peptide in cancerous tissue. Applicants have addressed 
each of these arguments in turn. 

First, the Applicants provided a first Declaration of Chris Grimaldi stating that the gene 
expression data in Example 18 are real and significant. This declaration also indicates that given 
the relative difference of at least two-fold in expression levels, the disclosed nucleic acids have 
utility as cancer diagnostic tools. The PTO has not offered any substantial reason or evidence to 
question the data in Example 18, or the first Grimaldi Declaration. Therefore, Applicants have 
established a basis for the use of the claimed nucleic acids as diagnostic tools, particularly for 
esophageal tumor. Second, Applicants have demonstrated that it is not necessary to know the 
cause or consequence of the differential expression of PRO 1302 nucleic acids in certain tumors 
in order to use them as diagnostic tools for cancer. Third, Applicants state that whether the 
encoded polypeptide is also differentially expressed in certain tumors is currently not at issue in 
this application. 



-27- 



Appl. No. : 10/063,714 

Filed : May 8, 2002 



Applicants have shown that the Pennica et al reference is irrelevant to the utility of the 
claimed nucleic acids. Applicants have also shown that the Hu et al and Haynes et al references 
are not contrary to Applicants' asserted utility. Taken together, these references do not satisfy 
the PTO's burden of offering evidence to prove that one of skill in the art would reasonably 
doubt the asserted utility. 

Finally, the PTO asserts that there is no asserted specific utility. Applicants have pointed 
out that the substantial utilities described above are specific to the claimed nucleic acid because 
the PRO 1302 gene and polypeptide are differentially expressed in esophageal tumors compared 
to the corresponding normal esophagus tissue. This is not a general utility that would apply to 
the broad class of nucleic acids. 

Given the totality of the evidence provided, Applicants submit that they have established 

a substantial, specific, and credible utility for the claimed nucleic acids as a diagnostic tool. 

According to the PTO Utility Examination Guidelines (2001), irrefutable proof of a claimed 

utility is not required. Rather, a specific, substantial, and credible utility requires only a 

"reasonable" confirmation of a real world context of use. Applicants remind the PTO that: 

A small degree of utility is sufficient . . . The claimed invention must only be 
capable of performing some beneficial function ... An invention does not lack 
utility merely because the particular embodiment disclosed in the patent lacks 
perfection or performs crudely ... A commercially successful product is not 
required . . . Nor is it essential that the invention accomplish all its intended 
functions ... or operate under all conditions . . . partial success being sufficient to 
demonstrate patentable utility ... In short, the defense of non-utility cannot be 
sustained without proof of total incapacity. If an invention is only partially 
successful in achieving a useful result, a rejection of the claimed invention as a 
whole based on a lack of utility is not appropriate. M.P.E.P. at 2107.01 (underline 
emphasis in original, bold emphasis added, citations omitted). 

Applicants submit that they have established that it is more likely than not that one of 
skill in the art would reasonably accept the utility for the claimed nucleic acids as diagnostic 
tools as set forth in the specification. In view of the above, Applicants respectfully request that 
the PTO reconsider and withdraw the utility rejection under 35 U.S.C. § 101 . 
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Rejections under 35 U.S.C. § 112, first paragraph - Enablement 

The PTO also rejects Claims 1-14, and 16-20 under 35 U.S.C. § 112, first paragraph. 
Specifically, the PTO asserts that because the claimed invention is not supported by either a 
specific or substantial asserted utility or a well-established utility, one skilled in the art would not 
know how to use the claimed invention. 

Applicants submit that in the discussion of the 35 U.S.C. § 101 rejection above, 
Applicants have established a substantial, specific, and credible utility for the claimed nucleic 
acids. To the extent that the enablement rejection is based on a lack of utility, Applicants 
respectfully request that the PTO reconsider and withdraw the enablement rejection under 35 
U.S.C. §112. 

Rejection under 35 U.S.C. §112, first paragraph - Written Description 

The rejection of Claims 1-5 under 35 U.S.C. § 112, first paragraph, as failing to satisfy 
the written description requirement is maintained for the reasons set forth in the previous Office 
Action. Briefly, the PTO asserts the "Applicants were not in possession of all or a significant 
number of polypeptides that have 80-99% homology to SEQ ID NO:85, while retaining the 
function of SEQ ID NO:85." Office Action at 10. 

The Legal Standard for Written Description 

The well-established test for sufficiency of support under the written description 
requirement of 35 U.S.C. §112, first paragraph is whether the disclosure "reasonably conveys to 
artisan that the inventor had possession at that time of the later claimed subject matter." In re 
Kaslow, 707 F.2d 1366, 1375, 2121 USPQ 1089, 1096 (Fed. Cir. 1983); see also Vas-Cath, Inc. 
v. Mahurkar, 935 F.2d atl563, 19 USPQ2d at 1116 (Fed. Cir. 1991). The adequacy of written 
description support is a factual issue and is to be determined on a case-by-case basis. See e.g., 
Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 USPQ2d at 1116 (Fed. Cir. 1991). The factual 
determination in a written description analysis depends on the nature of the invention and the 
amount of knowledge imparted to those skilled in the art by the disclosure. Union Oil v. Atlantic 
Richfield Co., 208 F.3d 989, 996 (Fed. Cir. 2000). 
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The Current Invention is Adequately Described 

As noted above, whether the Applicants were in possession of the invention as of the 
effective filing date of an application is a factual determination, reached by the consideration of a 
number of factors, including the level of knowledge and skill in the art, and the teaching 
provided by the specification. The inventor is not required to describe every single detail of 
his/her invention. An Applicant's disclosure obligation varies according to the art to which the 
invention pertains. The present invention pertains to the field of recombinant DNA/protein 
technology. It is well-established that the level of skill in this field is very high since a 
representative person of skill is generally a Ph.D. scientist with several years of experience. 
Accordingly, the teaching imparted in the specification must be evaluated through the eyes of a 
highly skilled artisan as of the date the invention was made. 

The subject matter of the pending claims concerns nucleic acids having 95% or 99% 
sequence identity to the nucleic acid sequence of SEQ ID NO: 85, the full-length coding sequence 
of the nucleic acid sequence of SEQ ID NO:85, or the full-length coding sequence of the cDNA 
deposited under ATCC accession number 203230, with the functional recitation as amended: 
"wherein said isolated nucleic acid is more highly expressed in normal esophagus compared to 
esophageal tumor" or "wherein said isolated nucleic acid hybridizes to the complement of a 
nucleic acid of SEQ ID NO:85" under the specified conditions. Other claimed nucleic acids are 
those which hybridize to the nucleic acid sequence of SEQ ED NO: 85, the full-length coding 
sequence of the nucleic acid sequence of SEQ ID NO:85, the full-length coding sequence of the 
cDNA deposited under ATCC accession number 203230, or the complements thereof, under the 
specified stringent conditions. We turn first to the claims which recite specific high stringency 
hybridization conditions. 

In Enzo Biochem v. Gen-Probe Inc., 323 F.3d 956 (Fed. Cir. 2002), the Court held that 
functional descriptions of genetic material may satisfy the written description requirement. In so 
holding, the Court gave judicial notice to the USPTO's Manual of Patent Examining Procedure, 
which provides that the written description requirement may be satisfied when the disclosure 
provides sufficiently detailed identifying characteristics, such as "complete or partial structure, 
other physical and/or chemical properties, functional characteristics when coupled with a known 
or disclosed correlation between function and structure, or some combination of such 
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characteristics." Id. at 964, quoting 66 Fed. Reg. at 1106 (emphasis in original). In Enzo, the 
Court found describing nucleic acids based on their ability to hybridize to another nucleic acid 
sequence which was adequately described may be an adequate description of the nucleic acid. 
This is because the hybridization function of a nucleic acid is dependent on the sequences of the 
nucleic acid - a disclosed function which is coupled with a known correlation between function 
and structure. The Court favorably discussed the PTO's example wherein "genus claims to 
nucleic acids based on their hybridization properties... may be adequately described if they 
hybridize under highly stringent conditions to known sequences because such conditions dictate 
that all species within the genus will be structurally similar ." Id. at 967 (citing Application of 
[Written Description] Guidelines, Example 9) (emphasis added). 

Applicants submit that the stringent hybridization conditions specified in the pending 
claims, alone or in combination with the recited percent sequence identity, result in all species 
within the genus being structurally similar. As the Enzo Court noted, Examples 9 and 10 of the 
Application of Written Description Guidelines (hereinafter "Guidelines") make clear that 
specifying hybridization under highly stringent conditions yields "structurally similar DNAs." 
Guidelines, Example 9 at page 36. The analysis of a genus claim in Example 10 of the 
Guidelines states: 

[T]urning to the genus analysis, the art indicates that there is no substantial 
variation within the [claimed] genus because of the stringency of hybridization 
conditions which yields structurally similar molecules. The single disclosed 
species is representative of the genus because reduction to practice of this species, 
considered along with the defined hybridization conditions and the level of skill 
and knowledge in the art, are sufficient to allow the skilled artisan to recognize 
that applicant was in possession of the necessary common attributes or features of 
the elements possessed by the members of the genus. Guidelines, Example 10 at 
page 39 (emphasis added). 

Given the level of skill in the art, specifying highly stringent conditions leads to "no 
substantial variation within the [claimed] genus," and therefore a skilled artisan would recognize 
that the Applicants were in possession of the necessary common attributes or features of the 
genus. The common element or attribute of the claimed genus is that species of the genus are 
structurally related to SEQ ID NO:85, such that they hybridize to SEQ ID NO:85 or the related 
sequences under the specified high stringency conditions recited in the claims. 
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The present situation is not analogous to Fiddes v. Baird, 30 U.S.P.Q. 2d 1481, cited by 
the PTO in the first Office Action. Unlike Fiddes, where arguably the structure of other 
mammalian sequences could not be conceived based on a single species of the genus, here the 
skill in the art is such that the sequence of nucleic acids which hybridize to SEQ ID NO: 85 under 
the conditions specified can be conceived. Here, the claimed genus is defined by its structure - 
members of the genus hybridize under the specified conditions to the specified sequences, each 
of which are adequately described in the specification. 

Applicants submit that the pending claims relating to nucleic acids having 95% or 99% 
sequence identity to the nucleic acids related to SEQ ID NO: 85 with the functional recitation 
"wherein said isolated nucleic acid is more highly expressed in normal esophagus compared to 
esophageal tumor" are also adequately described. In Example 14 of the written description 
training materials, the written description requirement was found to be satisfied for claims 
relating to polypeptides having 95% homology to a particular sequence and possessing a 
particular catalytic activity, even though the applicant had not made any variants. Similarly, the 
pending claims also have very high sequence homology to the disclosed sequences and must 
share the same expression pattern in esophageal tumors. In Example 14, the procedures for 
making variants were known in the art and the disclosure taught how to test for the claimed 
catalytic activity. Similarly, in the instant application, it is well known in the art how to make 
nucleic acids which have at least 95% sequence identity to the disclosed sequences, and the 
specification discloses how to test to determine if the sequence is differentially expressed in 
esophageal tumors. Like Example 14, the genus of nucleic acids that have at least 95% or 99% 
sequence identity to the disclosed sequences will not have substantial variation since all of the 
variants must have the same expression in certain tumors. 

Furthermore, while Applicants appreciate that actions taken by the PTO in other 
applications are not binding with respect to the examination of the present application, 
Applicants note that the PTO has issued many patents containing claims to variant nucleic acids 
or variant proteins where the applicants did not actually make such nucleic acids or proteins. 
Representative patents include U.S. Patent No. 6,737,522, U.S. Patent No. 6,395,306, U.S. Patent 
No. 6,025,156, U.S. Patent No. 6,645,499, U.S. Patent No. 6,498,235, and U.S. Patent No. 
6,730,502, which are attached hereto as Exhibits 10-15. 
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In conclusion, Applicants submit that they have satisfied the written description 
requirement for the pending claims based on the actual reduction to practice of SEQ ID NO:85, 
by specifying the high stringency conditions under which hybridization occurs, and by describing 
the gene expression assay, all of which result in a lack of substantial variability in the species 
falling within the scope of the instant claims. Applicants submit that this disclosure would allow 
one of skill in the art to "recognize that the applicant was in possession of the necessary common 
attributes or features of the elements possessed by the members of the genus." Hence, 
Applicants respectfully request that the PTO reconsider and withdraw the written description 
rejection under 35 U.S.C. §112. 
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CONCLUSION 

In view of the above, Applicants respectfully maintain that claims are patentable and 
request that they be passed to issue. Applicants invite the Examiner to call the undersigned if any 
remaining issues may be resolved by telephone. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 1 1-1410. 
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Correlation between Protein and mRNA Abundance in Yeast 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient. 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
display-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, denned as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by microspray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. 1. Schematic illustration of proteome analysis by 2DE and mass spectrometry. In part I, proteins are separated by 2DE, stained spots are excised and subjected 
to in-gel digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performance liquid chromatography. In part II, a peptide is shown 
eluting from the column in part I. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass-to-charge ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of differing lengths (masses). These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two ion 
series are recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part HI, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 (MATa ura3-52 lys2-801 adel-101 
leu2-bl his3-b200 trpl-&63) (30). Logarithmically growing cells were obtained by 
growing yeast cells to early log phase (3 x 10 6 cells/ml) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) 
at 30°C (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Garrels and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80°C. 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Multiphor II; Pharmacia Biotech). Immobilized 
polyacrylamide gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Amersham-Pharmacia Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 
to 380 ill of solution was added to individual lanes of an IPG strip rehydration 
tray (Amersham-Pharmacia Biotech). The strips were allowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wt/vol) SDS-6 M urea-30% (wt/vol) glycerol-0.05 M Tris HCI (pH 6.8) and 
for 4 min in 2.5% iodoacetamide in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) 
glycerol-0.05 M Tris HCI (pH 6.8). Following reequilibration, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
acrylamide gels were poured in a casting stand with 10% acrylamide-2.67% 
piperazine diacrylamide-0.375 M Tris base-HQ (pH 8.S)-0,1% (wt/vol) SDS-0.05% 
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FIG. 2. 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometry and database searching. The spots are labeled on the 
gel and correspond to the data presented in Table 1 . Molecular weights are given in thousands. 



(wt/vol) ammonium persulfate-0.05% TEMED (/vyV,W',Ar-tetramethylethyl- 
enediamine) in Milli-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycosciences, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 mA (constant)-SOO V-85 W for 20 min, followed by 200 mA (constant)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shevchenko 
et al. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by mi- 
crocapillary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein sequences) (24a) by using the 
computer program Sequest (8), which matches theoretical and acquired tandem 
mass spectra. A protein match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. AH protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3'-most cutting site for NlalU, Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Velculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2JYEAST and TDH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 TABLE 1— Continued 



Mol wt 


pi 


Spot no. 


YPD gene 
name" 


Protein 
abundance 
(10 3 copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


17,259 


6.75 


133 


CPR1 


15.2 


61.7 


0.769 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


18,726 


4.44 


147 


YKL056C 


61.2 


88.4 


0.831 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


19,108 


5.04 


130 


YLR109W 


94.4 


9.7 


0.680 


19,681 


9.08 


136 


ATP7 


11.0 


NA*« C 


0.246 


20,505 


6.07 


111 


GUK1 


16.5 


3.7 


0.422 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


21,583 


4.98 


95 


TSA1 


110.6 


40.1 


0.845 


22,602 


4.30 


80 


EFB1 


66.1 


23.8 


0.875 


23,079 


6.29 


112 


SOD2 


12.6 


2.2 


0.351 


23,743 


5.44 


137 


HSP26 


NA rf 


0.7 


0.434 


24,033 


5.97 


96 


ADK1 


17.4 


16.4 


0.656 


24,058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


24,353 


6.30 


140 


TFS1 


8.1 


0.7 


0.146 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


24,808 


6.33 


97 


GSP1 


26.3 


5.2 


0.735 


24,908 


8.73 


122 


RPS5 


18.6 


NA C 


0.899 


25,081 


4.65 


81 


MRP8 


9.3 


NA" 


0.241 


25,960 


6.06 


116 


RPE1 


5.8 


0.7 


0.372 


26,378 


9.55 


127 


RPS3 


96.8 


NA" 


0.863 


26,467 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


26,661 


5.84 


98 


TPI1 


NA* 


NA" 


0.900 


27,156 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


27,334 


6.13 


115 


YHR049W 


18.4 


2.2 


0.520 


27,472 


5.33 


92 


YNL010W 


31.6 


3.7 


0.421 


27,480 


8.95 


123 


GPM1 


10.0 


169.4 


0.902 


27,480 


8.95 


124 


GPM1 


231.4 


169.4 


0.902 


27,480 


8.95 


125 


GPM1 


7.5 


169.4 


0.902 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0.381 


27,874 


4.46 


78 


YST1 


13.6 


52.8 


0.805 


28,595 


4.51 


41 


PUP2 


4.4 


0.7 


0.147 


29,156 


6.59 


114 


YMR226C 


14.5 


2.2 


0.283 


29,244 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


30,012 


6.39 


138 


PRB1 


21.2 


1.5 


0.449 


30,073 


4.63 


77 


BMH1 


14.7 


28.2 


0.454 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


30,435 


6.34 


89 


GPP1 


70.2 


11.2 


0.703 


31,332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


32,159 


5.46 


113 


IPP1 


63.1 


3.7 


0.752 


32,263 


6.00 


149 


HIS1 


22.4 


4.5 


0.232 


33,311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


34,465 


5.60 


129 


ADE1 


8.7 


5.2 


0.305 


34,762 


5.32 


85 


SEC14 


10.9 


6.0 


0.373 


34,797 


5.85 


42 


URA1 


49.5 


8.9 


0.237 


34,799 


6.04 


90 


BEL1 


103.2 


81.0 


0.875 


35,556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


35,619 


8.41 


59 


TDH1 


69.8 


32.r 


0.940 


35,650 


5.49 


68 


CAR1 


5.2 


3.0 


0.339 


35,712 


6.72 


117 


TDH2 


49.6 


473.0" 


0.982 


35,712 


6.72 


154 


TDH2 


863.5 


473.0" 


0.982 


35,712 


6.72 


155 


TDH2 


79.4 


473.0" 


0.982 


36,272 


4.85 


128 


APA1 


8.7 


0.7 


0.425 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


36,358 


5.05 


76 


YJR105W 


27.5 


17.1 


0.522 


36,596 


6.37 


79 


ADH2 


58.9 


260.0" 


0.711 


36,714 


6.30 


102 


ADH1 


746.1 


260.0 


0.913 


36,714 


6.30 


103 


ADH1 


17.6 


260.0 


0.913 


36,714 


6.30 


104 


ADH1 


61.4 


260.0 


0.913 


36,714 


6.30 


105 


ADH1 


52.7 


260.0 


0.913 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


37,796 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


38,700 


7.83 


55 


BAT1 


30.9 


11.2 


0.469 


38,702 


6.24 


46 


QCR2 


NA d 


2.2 


0.326 



Mol wt 


Pi 


Spot no. 


YPD gene 
name" 


Protein 
abundance 
(10 3 copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


39,477 


5.58 


86 


FBA1 


17.8 


183.6 


0.935 


39,477 


5.58 


87 


FBA1 


427.2 


183.6 


0.935 


39,540 


6.50 


150 


HOM2 


60.3 


4.5 


0.592 


39,561 


6.12 


156 


PSA1 


96.4 


27.5 


0.718 


41,158 


6.01 


49 


YNL134C 


14.9 


1.5 


0.316 


41,623 


7.18 


58 


BAT2 


19.0 


8.9 


0.250 


41,728 


7.29 


110 


ERG 10 


24.1 


4.5 


0.543 


41,900 


5.42 


74 


TOM40 


22.3 


2.2 


0.375 


42,402 


6.29 


45 


CYS3 


6.7 


8.9 


0.621 


42,883 


5.63 


67 


DYS1 


15.8 


5.2 


0.526 


43,409 


6.31 


107 


SER1 


10.5 


1.5 


0.292 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.408 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


44,682 


4.99 


72 


TIF1 


2.9 


39.4 


0.834 


44,707 


7.77 


108 


PGK1 


23.7 


165.7 


0.897 


44,707 


7.77 


109 


PGK1 


315.2 


165.7 


0.897 


46,080 


6.72 


30 


CAR2 


15.4 


NA C 


0.495 


46,383 


8.52 


53 


IDP1 


7.7 


0.7 


0.436 


46,553 


5.98 


47 


IDP2 


32.4 


NA C 


0.197 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


46,679 


6.39 


51 


ENOl 


6.6 


0.7 


0.930 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


0.930 


46,773 


5.82 


63 


EN02 


15.5 


289.1 


0.960 


46,773 


5.82 


64 


EN02 


635.5 


289.1 


0.960 


46,773 


5.82 


65 


EN02 


93.0 


289.1 


0.960 


46,773 


5.82 


66 


EN02 


31.0 


289.1 


0.960 


47,402 


6.09 


126 


COR1 


2.5 


0.7 


0.422 


47,666 


8.98 


54 


AAT2 


11.7 


6.0 


0.338 


48,364 


5.25 


73 


WTM1 


74.5 


13.4 


0.365 


48,530 


6.20 


61 


MET17 


38.1 


29.0 


0.576 


48,904 


5.18 


69 


LYS9 


16.2 


3.7 


0.463 


48,987 


4.90 


153 


SUP45 


29.6 


11.9 


0377 


49,727 


5.47 


70 


PR02 


13.6 


5.2 


0.297 


49,912 


9.27 


62 


TEF2 


558.5 


282.0 


0.932 


50,444 


5.67 


35 


YDR190C 


4.8 


2.2 


0.228 


50,837 


6.11 


32 


YEL047C 


3.8 


1.5 


0.387 


50,891 


4.59 


151 


TUB2 


11.2 


7.4 


0.404 


51,547 


6.80 


27 


LPD1 


18.9 


2.2 


0.351 


52,216 


7.25 


29 


SHM2 


19.7 


7.4 


0.722 


52,859 


5.54 


37 


YFR044C 


30.2 


6.7 


0.442 


53,798 


5.19 


71 


HXK2 


26.5 


7.4 


0.756 


53,803 


6.05 


145 


GYP6 


4.4 


0.7 


0.147 


54,403 


5.29 


39 


ALD6 


37.7 


2.2 


0.664 


54,403 


5.29 


40 


ALD6 


6.6 


2.2 


0.664 


54,502 


6.20 


31 


ADE13 


6.3 


1.5 


0.417 


54,543 


7.75 


25 


PYK1 


225.3 


101.8 


0.965 


54,543 


7.75 


26 


PYK1 


39.8 


101.8 


0.965 


55,221 


6.66 


146 


YEL071W 


16.3 


3.0 


0.244 


55,295 


4.35 


134 


PDI1 


66.2 


14.1 


0.589 


55,364 


5.98 


24 


GLK1 


22.6 


6.0 


0.237 


55,481 


7.97 


118 


ATP1 


21.6 


2.2 


0.637 


55,886 


6.47 


28 


CYS4 


22.2 


NA C 


0.444 


56,167 


5.83 


33 


AR08 


14.3 


3.0 


0.324 


56,167 


5.83 


34 


AR08 


9.1 


3.0 


0.324 


56,584 


6.36 


20 


CYB2 


18.9 


NA C 


0.259 


57,366 


5.53 


60 


FRS2 


2.3 


0.7 


0.451 


57,383 


5.98 


144 


ZWF1 


5.6 


0.7 


0.215 


57,464 


5.49 


36 


THR4 


21.4 


3.7 


0.508 


57,512 


5.50 


7 


SRV2 


6.5 


NA C 


0.260 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


58,573 


6.47 


17 


ACH1 


4.4 


1.5 


0.327 


58,573 


6.47 


18 


ACH1 


5.4 


1.5 


0.327 


61,353 


5.87 


21 


PDC1 


6.5 


200.7 


0.962 


61,353 


5.87 


22 


PDC1 


303.2 


200.7 


0.962 


61,353 


5.87 


23 


PDC1 


16.3 


200.7 


0.962 


61,649 


5.54 


38 


CCT8 


2.2 


1.5 


0.271 



Continued 



Continued on following page 
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TABLE I— Continued 



Mol wt 


Pi 


Spot no. 


Y yd gene 
name 


Protein 
abundance 
(10^ copies/ 
cell) 


mRNA 
abundance 
(copies/cell) 


Codon 
bias 


61,902 


6.21 


101 


PDC5 


4.3 


NA C 


0.828 


62,266 


6.19 


16 


ICL1 


20.1 


NA C 


0.327 


62,862 


8.02 


19 


ILV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


2.2 


3.0 


0.402 


64,335 


5.77 


5 


PAB1 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


STI1 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STI1 


6.4 


0.7 


0.313 


66,450 


5.29 


141 


SSB2 


7.0 


NA C 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


NA C 


0.880 


66,456 


5.23 


10 


SSB1 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSB1 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSB1 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEW 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0.353 


75,396 


5.82 


6 


GRS1 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA C 


0.772 


85,720 


6.25 


2 


MET6 


10.9 


NA C 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA C 


0.772 


93,276 


6.11 


131 


EFT1 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFT1 


5.7 


41.6 


0.890 


102,064' 


6.61' 


94 


ADE3 


4.8 


5.2 


0.423 


107,482' 


5.33' 


4 


MCM3 


2.7 


NA C 


0.240 



■ YPD gene names are available from the YPD website (39). 
b NA, calculation could not be performed or was not available. 
u mRNA data inconclusive or NA. 

rf No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

* Measured molecular weight or pi did not match theoretical molecular weight 
or pi. 



Protein quantitation. [ 3S S]methionine-labeled gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-ml microcentrifuge tubes, and scintillation cocktail (100 u,l) was added. The 
samples were vortexed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidene difluoride membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/methionine was found. This number was used to quantitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast metabolically labeled for 1, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half- lives were calculated 
based on the N-end rule (33). When the N-terminal processing was not known 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidase (31). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 jig), 
metabolically labeled with [ 35 S]methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2/TDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupole selected a single mass-to-charge ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupole scanned the mass range from 50 to 1,400 m/z. The computer program Sequest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of ceils and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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study. (Q Distribution of identified proteins in this study based on predicted 
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mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG. 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a 2D gel chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure. It contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0.356. 



staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [ 35 S]methionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 u,g) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. Trie data represent the same 106 genes as in Fig. 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0,2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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_ s. If these minor cell proteins differ among cells to the same extent as the 
^jnore abundant proteins, as is commonly assumed, only a small number of pro- 
^ differences (perhaps several hundred) suffice to create very large differences 
jo cell morphology behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization—different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism dependon the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
twft (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
(traaslattonal control), (5) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
w^ys of regulating gene expression. 

Summary 

THe genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
paint of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 



Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off geries 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in protfeins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other; this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism far cell memory. 

in eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

bi addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylatipn also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription axe the predominant 
fonn of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
though these posttranscriptional controls, which operate after RNA polymerase 
«as bound to the gene's promoter and begun RNA synthesis, are less common 
wan transcriptional control, for many genes they are crucial It seems that every 
aep in gene expression that could.be controlled in principle is likely to be regu- 
lated under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 
er, according to the sequence of events that might be experienced by an RNA 
m °lecule after its transcription has begun (Figure 9-72). 
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Figure 6-3 Genes can be expressed 
with different efficiencies* Gene A is 
transcribed and translated much mare 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence — a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence, Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two* respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence die name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that is central to prion diseases. PrP is highly unusual 
because the misfolded version of the protein, called PrP*, induces the normal PrP protein it contacts to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that Is especially prone to aggregation, but because this structure Is 
not infectious In this way. It cannot spread from one animal to another, (Q Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a P sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
p-styands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpen, adapted from M. Sunde et alj. Mo/. Bfot 273:729-739, 
1997; D, adapted from S.B. Prusiner, Tnnds Bfocnem. Sd. 21:482-487, 1996.) 



animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease 9 ) from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 



There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that ceils have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein In a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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lire 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Bach amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryottc gene expression can be 
controlled* Controls that operate at 
steps I through 5 are discussed in. this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation' 
(discussed in Chapter 3) as well as 
irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed {RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosoi 
and determining where in the cytosoi they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
byribosomes (translation^ control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5) . 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules* A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes In their environment, such as signals 
from other cells. Although all of the steps involved In expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
by a single signal. Many others are complex and act as tiny microprocessors, 
.responding to a variety of signals that they interpret and integrate to switch the 
neighboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired However promo ters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 



Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



VERTEBRATE ANCESTOR DNA 



RNA 



methylation of 
most CG sequences 
in germ line 



many millions of years 
of evolution 



VERTEBRATE DNA 



J 



CG island 



Figure 7-86 A mechanism to explain 
both the marked overafll deficiency 
at CG sequences and their cllusterSmg 
into CG islands in .vertebrate 
genomes. A black line marks the location 
of a CG dinudeotide In the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinudeotide. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the other hand, tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttramcriptionia! 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control for 
many genes they are crucial. 
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Regulation of transcription 
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X\tt phenotyplc dinVrences thai distinguish the 
tA rious kinds or ve lis in a highe r eufcaryote are 
largely due to' differences hi the expression of 
genes that code for proteins, thai is. thn.se tran- 
scribed hy UNA polymerase II. lit principle, die 
repression of these genes origin lie regulated al 
any one of several stages. The concept of Hie 
-level or conlror implies that gene expression 
' ^ not necessarily an automatic process once il 
has begun. Il could hp regulated in ' a gene- 
specific way al any one oT several sequential 
steps- We can (listing uisli (al least) five poten- 
tial control points, forming the series: 

Activation or grin- structure 
I 

'Initial ton of transcript urn 
4 

processing the imnscripj 

transport, to cytoplasm 
1 

Translation of niRNA 

The existence of the first step is implied by 
lite discovery that genes may exisi In either of 
nvo structural conditions. Ilelailvr in the a late 
of most of the genome, genes are found In 
an *nclivc* state In the cells in which they 
ant expressed {see Chapter *:|. Tlie change ot 
structure 1$ distinct from the act of transcrip* 
lion, and indicates that Ihe gene is "transcrib- 
able." This suggests thai acquisition of the 
"active" structure musl be Ihe first step in gene 
expression. 

Transcription of a gene in the active state is 



controlled al Ihe stage of initiation, thai Is, by 
the Interaction of RNA polymerase with its pro- 
moter. This Is now becoming susceptible to 
analysis In the in vitro s.vste.ms (see Chapter 
*«). For most genes, this is a major control 
point: probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in enkary- 
oiic cells* for example, via aittiierminalion 
mechanisms. 

The primary transcript is modified by capping 
at lite 5' end. and usually also by polyatlenyla- 
tion at the 3' end. Introns must I* spliced out 
from the transcripts of interrupted genes. The 
mature UNA must he exported Train Ihe nucleus 
to irte cytoplasm. Regulation or gene expression 
by selection of sequences ni the level of nuclear 
RXA might involve any o.r all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means of alternative splicing pat- 
ients whose regulation controls Ihe lype of pro- 
tein product (see Chapter 3D). 

Finally. Ihe translation of an tnKNA In the cyto- 
plasm can be specifically controlled. There Is little 
evidence for the employment of this mechanism in 
adult somatic cells. bu| it does occur in some 
embryonic situations, as described in Chapter 7. 
-The mechanism is presumed to involve the block- 
ing of (nutation or translation or some mftNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at muiUple stages, and 
that production of UNA cannot inevitably be 
equated with production of protein, it Is clear 
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that Ihe overwhelming ' majority of regulatory 
. events occur al Ihe initiation of transcription* 
Regulation of tissue -specific gene transcription 
lies al the heart or eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves (o provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor, and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is thai 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heat shock response element), ORE 
(glucocorticoid response element), SHE (scrum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for- a short distance on either side of 



Table 29.1 lncuctaietranscr:pi.ofi factors bind to 
response elements that identify groups cl promoters 
or eitoincsrs subject lo coordinate ccr.lrcl. 



Rtpulttoiy Agent Module Consensus 



Factor 
HSTF 



Heet meek HSE CNNGMANNTCCNNG 

Ok*ocorticoid CAE TGQTACAAATGTTC7 

PfWtxX aster TftE TGACTCA API 

Sewn SHE CCATATTAGC SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
Ihe start point, but are usually <200 bp upstream 
or II. The presence of a single element usually 
Is sufficient to confer the- regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located. in P* 0 " 
molers or In enhancers. Some types of element* 
are typically found in one rather than the other- 
usually an HSE is found in a promoter, *hU e J 
GKE is found in an enhancer. We assume th* 
all response elements function by the sa flH> 
general principle. A gene is regulated k 1 ' 0 
sequence at tlie promoter or enhancer that 
recognized py a specific protein. The P** 1 '** 
Junctions as a transcription factor ncedtd J* 
RNii polymerase to initiate.' Active proteist * 
available only under conditions when the. gtnt 
to be expressed; its absence means that tt* 
meter- is not activated by this particular cfr** ' 

An example of a situation in which 



genes are controlled by a single factor 
\1ded by Ihe heal shock response. This 



IsP^ 

mon lo a wide range of prokaryo*** ^ 
eukaryoles and Involves multiple coning > 
gene expression; an increase in temp*** ^ 



turns off transcription of some genes* 



turn* 



transcription or the heat shock fi ellBS V^^ 
causes changes in the translation of 0 
The control of the heal shock genes Uj uSl aI ,d 
the differences between *prokaryo lic flf * 



eukaryotic modes of control. In bacterial ^ 
sigma factor is synthesized lhat dlrec' 5 ^ 
polymerase holoenzyme to recognize * n - 



brld' jbumal of Surgical Oncology | Full text | Prostate stem cell antigen (PSCA) expression in human ... Page 1 of 1 1 





16-Nov-2004 




WORLD JOURNAL OF 
SURGICAL ONCOLOGY 




Breast Cancer 

R £ SEARCH 




Dme | Browse articles j Search | Weblinks | Submit article | My WJSO | About WJSO 




Vorld Journal of Surgical 
tocology 

Volume 2 

Hewing options: 

• Abstract 

• Full text 

• PDF (778KB) 

>ther links: 

• E-mail to a friend 

• Download references 

• Post a comment 

• PubMed record 

• Related articles in PubMed 

>earch PubMed For: 

!higang Z 
Venlv S 

Cey: 

13 E-mail 

SS Corresponding author 



Outline 

Abstract 
Introduction 
Materials and methods 
Results 
Discussion 
Competing interests 
References 



Prostate stem cell antigen (PSCA) expression in 
human prostate cancer tissues and its potential role 
in prostate carcinogenesis and progression of prostate 
cancer 

Zhao Zhigang 1 33 and Shen Wenlv 2 H 



iDepartment of Urology, Shantou University Medical College, Shantou, Guangdong, China 
2Department of Urology, No 2. Affiliated Hospital of Shantou University Medical College, Shantou, 
Guangdong, China 

World Journal of Surgical Oncology 2004, 2:13 doi:10. 1186/1477-7819-2-13 

The electronic version of this article is the complete one and can be found online at: 

.httpj//_w^ 

Received 30 March 2004 
Accepted 10 May 2004 
Published 10 May 2004 

© 2004 Zhigang and Wenlv; licensee BioMed Central Ltd. This is an Open Access article: 
verbatim copying and redistribution of this article are permitted in all media for any purpose, 
provided this notice is preserved along with the article's original URL. 



Keywords: Prostate, Neoplasm, Prostate stem cell antigen (PSCA) 

Abstract 
Background 

Prostate stem cell antigen (PSCA) is a recently defined homologue of the Thy-l/Ly- 
6 family of glycosylphosphatidyiinositol (GPI)-anchored cell surface antigens. The 
purpose of the present study was to examine the expression status of PSCA protein 
and mRNA in clinical specimens of human prostate cancer (Pea) and to validate it 
as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods 

Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 
benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial neoplasm (PIN) and 
48 prostate cancer (Pea) tissues, including 9 androgen-independent prostate 
cancers. The level of PSCA expression was semiquantitatively scored by assessing 
both the percentage and intensity of PSCA-positive staining cells in the specimens. 
Then compared PSCA expression between BPH, PIN and Pea tissues and analysed 
the correlations of PSCA expression level with pathological grade, clinical stage and 
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progression to androgen-independence in Pea. 
Results 

In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative 
and less intense and uniform than that seen in HGPIN and Pea. There were 
moderate to strong PSCA protein and mRNA expression in 8 of 11 (72.7%) HGPIN 
and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, with 
statistical significance compared with BPH (20%) and low grade PIN (22.2%) 
samples (p < 0.05, respectively). The expression level of PSCA increased with high 
Gleason grade, advanced stage and progression to androgen-independence (p < 
0.05, respectively). In addition, IHC and ISH staining showed a high degree of 
correlation between PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is overexpressed by 
a majority of human Pea. PSCA expression correlates positively with adverse tumor 
characteristics, such as increasing pathological grade (poor cell differentiation), 
worsening clinical stage and androgen-independence, and speculatively with 
prostate carcinogenesis. PSCA protein overexpression results from upregulated 
transcription of PSCA mRNA. PSCA may have prognostic utility and may be a 
promising molecular target for diagnosis and treatment of Pea. 
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Prostate cancer (Pea) is the second leading cause of cancer-related death in 
American men and is becoming a common cancer increasing in China. Despite 
recently great progress in the diagnosis and management of localized disease, 
there continues to be a need for new diagnostic markers that can accurately 
discriminate between indolent and aggressive variants of Pea. There also continues 
to be a need for the identification and characterization of potential new therapeutic 
targets on Pea cells. Current diagnostic and therapeutic modalities for recurrent 
and metastatic Pea have been limited by a lack of specific target antigens of Pea. 



Although a number of prostate-specific genes have been identified (i.e. prostate 
specific antigen, prostatic acid phosphatase, glandular kallikrein 2), the majority of 
these are secreted proteins not ideally suited for many immunological strategies. 
So, the identification of new cell surface antigens is critical to the development of 
new diagnostic and therapeutic approaches to the management of Pea. 

Reiter RE et a! [1] reported the identification of prostate stem ceil antigen (PSCA), 
a cell surface antigen that is predominantly prostate specific. The PSCA gene 
encodes a 123 amino acid glycoprotein, with 30% homology to stem cell antigen 2 
(Sea 2). Like Sca-2, PSCA also belongs to a member of the Thy-l/Ly-6 family and 
is anchored by a glycosylphosphatidylinositol (GPI) linkage. mRNA in situ 
hybridization (ISH) localized PSCA expression in normal prostate to the basal cell 
epithelium, the putative stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor cells. 



In order to examine the status of PSCA protein and mRNA expression in human Pea 
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and validate it as a potential diagnostic and therapeutic target for Pea, we used 
immunohistochemistry (IHC) and in situ hybridization (ISH) simultaneously, and 
conducted PSCA protein and mRNA expression analyses in paraffin-embedded 
tissue specimens of benign prostatic hyperplasia (BPH, n = 20), prostate 
intraepithelial neoplasm (PIN, n = 20) and prostate cancer (Pea, n = 48). 
Furthermore, we evaluated the possible correlation of PSCA expression level with 
Pea tumorigenesis, grade, stage and progression to androgen-independence. 
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Tables 
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Correlation of PSCA 
expression with Gleason 
score 



TABLE 


■ 


■ 


■ 


Hi 


■ 


■ 


■ 


■ 


■ 


■ 




■ 


■ 


■ 


■ 




■ 


■ 


■ 


■ 


■ 


■ 


■ 





Table 2 

Correlation of PSCA 
expression with clinical 
stage 



Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 patients of 
57-84 years old by prostatectomy, transurethral resection of prostate (TURP) or 
biopsies. The patients were classified as 20 cases of BPH, 20 cases of PIN, 40 cases 
of primary Pea, including 9 patients with recurrent Pea and a history of androgen 
ablation therapy (orchiectomy and/or hormonal therapy), who were referred to as 
androgen-independent prostate cancers. Eight specimens were harvested from 
these androgen-independent Pea patients prior to androgen ablation treatment. 
Each tissue sample was cut into two parts, one was fixed in 10% formalin for IHC 
and the other treated with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% DEPC 
for 1 h for ISH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 sections and mounted on the glass slides 
specific for IHC and ISH respectively in the usual fashion. H&E-stained section of 
each Pea was evaluated and assigned a Gleason score by the experienced 
urological pathologist at our institution based on the criteria of Gleason score [2]. 
The Gleason sums are summarized in Table 1. Clinical staging was performed 
according to Jewett-whitmore-prout staging system, as shown in Table 2. In the 
category of PIN, we graded the specimens into two groups, i.e. low grade PIN 
(grade I - II) and high grade PIN (HGPIN, grade III) on the basis of literatures 
[3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in 10 mmol/L citrate buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 
W oven for 5 min to induce epitope retrieval. Slides were allowed to cool at room 
temperature for 30 min. A primary mouse antibody specific to human PSCA 
(Boshide, Wuhan, China) with a 1:100 dilution was applied to incubate with the 
slides at room temperature for 2 h. Labeling was detected by sequentially adding 
biotinylated secondary antibodies and strepavidin-peroxidase, and localized using 
3,3'-diaminobenzidine reaction. Sections were then counterstained with 
hematoxylin. Substitution of the primary antibody with phosphate-buffered-saline 
(PBS) served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-nm-thick tissue sections were deparaffinized and dehydrated, then digested in 
pepsin solution (4 mg/ml in 3% citric acid) for 20 min at 37.5°C, and further 
processed for ISH. Digoxigenin-labeled sense and antisense human PSCA RNA 
probes (obtained from Boshide, Wuhan, China) were hybridized to the sections at 
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48°C overnight. The posthybridization wash with a high stringency was performed 
sequentially at 37°C in 2 x standard saline citrate (SSC) for 10 min, in 0.5 x SSC 
for 15 min and in 0.2 x SSC for 30 min. The slides were then incubated to 
biotinylated mouse anti-digoxigenin antibody at 37.5°C for 1 h followed by washing 
in 1 x PBS for 20 min at room temperature, and then to strepavidin-peroxidase at 
37.5°C for 20 min followed by washing in 1 x PBS for 15 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. Sections 
hybridized with the sense control probes routinely did not show any specific 
hybridization signal above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA immunostaining and 
mRNA in situ hybridization, the same scoring manners are taken in the present 
study for PSCA protein staining by IHC and PSCA mRNA staining by ISH. Each slide 
was read and scored by two independently experienced urological pathologists 
using Olympus BX-41 light microscopes. The evaluation was done in a blinded 
fashion. For each section, five areas of similar grade were analyzed 
semiquantitatively for the fraction of cells staining. Fifty percent of specimens were 
randomly chosen and rescored to determine the degree of interobserver and 
intraobserver concordance. There was greater than 95% intra- and interobserver 
agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a scale 
of 0 to 3+ with 3 being the highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The staining density was 
quantified as the percentage of cells staining positive for PSCA with the primary 
antibody or hybridization probe, as follows: 0 = no staining; 1 = positive staining in 
<25% of the sample; 2 = positive staining in 25%-50% of the sample; 3 = 
positive staining in >50% of the sample. Intensity score (0 to 3+) was multiplied 
by the density score (0-3) to give an overall score of 0-9 [1,5]. In this way, we 
were able to differentiate specimens that may have had focal areas of increased 
staining from those that had diffuse areas of increased staining [6]. The overall 
score for each specimen was then categorically assigned to one of the following 
groups: 0 score, negative expression; 1-2 scores, weak expression; 3-6 scores, 
moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PIN and Pea 
tissues were compared using the Chi-square and Student's t-test. Univariate 
associations between PSCA expression and Gleason score, clinical stage and 
progression to androgen-independence were calculated using Fisher's Exact Test. 
For all analyses, p < 0.05 was considered statistically significant. 



Outline Results 

Abstract PSCA expression in BPH 

Introduction 
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Figures 




In general, PSCA protein and mRNA were expressed weakly in individual samples of 
BPH. Some areas of prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite score 0). Four cases 
(20%) of BPH had moderate expression of PSCA protein and mRNA (composite 
score 4-6) by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA mRNA expression 
was moderate (composite score 3-6), but PSCA protein expression was weak 
(composite score 2) in one and negative (composite score 0) in the other. PSCA 
expression was localized to the basal and secretory epithelial cells, and prostatic 
stroma was almost negative staining for PSCA protein and mRNA in all cases 
examined. 




Figure 1 

Representatives of PSCA 
IHC and ISH staining in Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



PSCA expression in PIN 

In this study, we detected weak or negative expression of PSCA protein and mRNA 
(<2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of 11 (18.2%) HGPIN, and 
moderate expression (3-6 scores) in the rest 2 low grade PIN and 5 of 11 (45.5%) 
HGPIN. One HGPIN with moderate PSCA mRNA expression (6 score) was found 
weak staining for PSCA protein (2 score) by IHC. Strong PSCA protein and mRNA 
expression (9 score) were detected in the remaining 3 of 11 (27.3%) HGPIN. There 
was a statistically significant difference of PSCA protein and mRNA expression 
levels observed between HGPIN and BPH (p < 0.05), but no statistical difference 
reached between low grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared with 
benign glands, Forty-eight paraffin-embedded Pea specimens were analysed by IHC 
and ISH. It was shown that 19 of 48 (39.6%) Pea samples stained very strongly for 
PSCA protein and mRNA with a score of 9 and another 21 (43.8%) specimens 
displayed moderate staining with scores of 4-6 (Figure 1). In addition, 4 specimens 
with moderate to strong PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea expressed a significantly 
higher level of PSCA protein and mRNA than any other specimen category in this 
study (p < 0.05, compared with BPH and PIN respectively). The result 
demonstrates that PSCA protein and mRNA are overexpressed by a majority of 
human Pea. 



Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described in Materials and Methods, 
we compared the expression level of PSCA protein and mRNA with Gleason grade 
of Pea, as shown in Table 1. Prostate adenocarcinomas were graded by Gleason 
score as 2-4 scores = well-differentiation, 5-7 scores = moderate-differentiation 
and 8-10 scores = poor-differentiation [7]. Seventy-two percent of Gleason scores 
8-10 prostate cancers had very strong staining of PSCA compared to 21% with 
Gleason scores 5-7 and 17% with 2-4 respectively, demonstrating that poorly 
differentiated Pea had significantly stronger expression of PSCA protein and mRNA 
than moderately and well differentiated tumors (p < 0.05). As depicted in Figure 1, 
IHC and ISH analyses showed that PSCA protein and mRNA expression in several 
cases of poorly differentiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA expression increases 
significantly with higher tumor grade in human Pea. 
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Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed the results in 
Table 2. Seventy-five percent of locally advanced and node positive cancers (i.e. C- 
D stages) expressed statistically high levels of PSCA versus 32.5% that were organ 
confined (i.e. A-B stages) (p < 0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen-independent 
progression of Pea 

All 9 specimens of androgen-independent prostate cancers stained positive for 
PSCA protein and mRNA. Eight specimens were obtained from patients managed 
prior to androgen ablation therapy. Seven of eight (87.5%) of these androgen- 
independent prostate cancers were in the strongest staining category (score = 9), 
compared with three out of eight (37.5%) of patients with androgen-dependent 
cancers (p < 0.05). The results demonstrate that PSCA expression increases 
significantly with progression to androgen-independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates significantly with 
increasing grade, worsening stage and progression to androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the results of PSCA IHC staining 
with mRNA ISH analysis. Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 of 88 (89.8%) 
specimens (18/20 BPH, 19/20 PIN and 42/48 Pea respectively). Importantly, 27/27 
samples with PSCA mRNA composite scores of 0-2, 32/36 samples with scores of 
3-6 and 22/24 samples with a score of 9 also had PSCA protein expression scores 
of 0-2, 3-6 and 9 respectively. However, in 5 samples with PSCA mRNA overall 
scores of 3-6 and in 2 with scores of 9 there were less or negative PSCA protein 
expression (i.e. scores of 0-4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by PSCA mAb may be 
obscured in some cancers. The data demonstrate that the results of PSCA 
immunostaining were consistent with those of mRNA ISH analysis, showing a high 
degree of correlation between PSCA protein and mRNA expression. 



Outline Discussion 
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PSCA is homologous to a group of cell surface proteins that mark the earliest phase 
of hematopoietic development. PSCA mRNA expression is prostate-specific in 
normal male tissues and is highly up-regulated in both androgen-dependent and- 
independent Pea xenografts (LAPC-4 tumors). We hypothesize that PSCA may play 
a role in Pea tumorigenesis and progression, and may serve as a target for Pea 
diagnosis and treatment. In this study, IHC and ISH showed that in general there 
were weak or absent PSCA protein and mRNA expression in BPH and low grade PIN 
tissues. However, PSCA protein and mRNA are widely expressed in HGPIN, the 
putative precursor of invasive Pea, suggesting that up-regulation of PSCA is an 
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early event in prostate carcinogenesis. Recently, Reiter RE et al [1], using ISH 
analysis, reported that 97 of 118 (82%) HGPIN specimens stained strongly positive 
for PSCA mRNA. A very similar finding was seen on mouse PSCA (mPSCA) 
expression in mouse HGPIN tissues by Tran C. P et al [8]. These data suggest that 
PSCA may be a new marker associated with transformation of prostate cells and 
tumorigenesis. 

We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly differentiated, 
androgen-independent and metastatic cases. Fluorescence-activated cell sorting 
and confocal/ immunofluorescent studies demonstrated cell surface expression of 
PSCA protein in Pea cells [9]. Our IHC expression analysis of PSCA shows not only 
cell surface but also apparent cytoplasmic staining of PSCA protein in Pea 
specimens (Figure 1). One possible explanation for this is that anti-PSCA antibody 
can recognize PSCA peptide precursors that reside in the cytoplasm. Also, it is 
possible that the positive staining that appears in the cytoplasm is actually from 
the overlying cell membrane [5], These data seem to indicate that PSCA is a novel 
cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high grade 
(i.e. poor differentiation), increased tumor stage and progression to androgen- 
independence of Pea. These findings support the original IHC analyses by Gu Z et al 
[9], who reported that PSCA protein expressed in 94% of primary Pea and the 
intensity of PSCA protein expression increased with tumor grade, stage and 
progression to androgen-independence. Our results also collaborate the recent 
work of Han KR et al [10], in which the significant association between high PSCA 
expression and adverse prognostic features such as high Gleason score, seminal 
vesicle invasion and capsular involvement in Pea was found. It is suggested that 
PSCA overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Hara H et al [11] used RT-PCR detection of PSA, 
PSMA and PSCA in 1 ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa patients, each PCR indicated the 
prognostic value in the hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases with positive PSCA PCR indicated lower disease-progression-free survival 
than those with negative PSCA PCR, demonstrating that PSCA can be used as a 
prognostic factor. Dubey P et al [12] reported that elevated numbers of PSCA + 
cells correlate positively with the onset and development of prostate carcinoma 
over a long time span in the prostates of the TRAMP and PTEN +/- models 
compared with its normal prostates. Taken together with our present findings, in 
which PSCA is overexpressed from HGPIN to almost frank carcinoma, it is 
reasonable and possible to use increased PSCA expression level or increased 
numbers of PSCA-positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility that PSCA 
may have diagnostic utility or clinical prognostic value in human Pea. 

The cause of PSCA overexpression in Pea is not known. One possible mechanism is 
that it may result from PSCA gene amplification. In humans, PSCA is located on 
chromosome 8q24.2 [1], which is often amplified in metastatic and recurrent Pea 
and considered to indicate a poor prognosis [ 13-15 ]. Interestingly, PSCA is in close 
proximity to the c-myc oncogene, which is amplified in >20% of recurrent and 
metastatic prostate cancers [16,17]. Reiter RE et al [18] reported that PSCA and 
MYC gene copy numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coamplification in Pea. Gu Z et al [9] recently reporteted that in 102 specimens 
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available to compare the results of PSCA immunostaining with their previous mRNA 
ISH analysis, 92 (90.2%) had identically positive areas of PSCA protein and mRNA 
expression. Taken together with our findings, in which we detected moderate to 
strong expression of PSCA protein and mRNA in 34 of 40 (85%) Pea specimens 
examined simultaneously by IHC and ISH analyses, it is demonstrated that PSCA 
protein and mRNA overexpressed in human Pea, and that the increased protein 
level of PSCA was resulted from the upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and its biological 
function are yet to be elucidated. PSCA expression may be regulated by multiple 
factors [18]. Watabe T et al [19] reported that transcriptional control is a major 
component regulating PSCA expression levels. In addition, induction of PSCA 
expression may be regulated or mediated through cell-cell contact and protein 
kinase C (PKC) [20]. Homologues of PSCA have diverse activities, and have 
themselves been involved in carcinogenesis. Signalling through SCA-2 has been 
demonstrated to prevent apoptosis in immature thymocytes [21]. Thy-1 is involved 
in Tcell activation and transducts signals through src-like tyrosine kinases [22]. 
Ly-6 genes have been implicated both in tumorigenesis and in cell-cell adhesion 
[ 23- 25]. Cell-cell or cell-matrix interaction is critical for local tumor growth and 
spread to distal sites. From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role in stem/progenitor cell 
function, such as self-renewal (i.e. anti-apoptosis) and/or proliferation [1]. Taken 
together with the results in the present study, we speculate that PSCA may play a 
role in tumorigenesis and clinical progression of Pea through affecting cell 
transformation and proliferation. From our results, it is also suggested that PSCA as 
a new cell surface antigen may have a number of potential uses in the diagnosis, 
therapy and clinical prognosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA-overexpressing cells in 
bone marrow or peripheral blood may identify and predict metastatic progression 
better than current assays, which identify only PSA-positive or PSMA-positive 
prostate cells. 

In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGPIN through all stages of Pea in a majority of 
cases, which may be associated with prostate carcinogenesis and correlate 
positively with high tumor grade (poor cell differentiation), advanced stage and 
androgen-independent progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results suggest that PSCA may be a 
promising molecular marker for the clinical prognosis of human Pea and a valuable 
target for diagnosis and therapy of this tumor. 
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flnttroducftooi 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit the differences In gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on Identifying 
differences In gene expression at the level of mRM\ which 
can be attributable either to DNA amplification or to differ- 
ences In transcription. Gene expression Is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translations! regulation has been best recog- 
nized among developmental biologists, because transcription 
does not occur In early embryogenesls In eukaryotes. For ex- 
ample, In Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches mldblastula transition, the 
4000-celI stage. Therefore, ail necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in atranslationally 
Inactive, masked form. The mRNA are transiationafly activated 
at appropriate times during oocyte maturation, fertilization, and 



eariy embjyogsnesls and thus, are under strict transnational! 
control. 

Translation has an established role In ceil growth. Basi- 
cally, an increase In proteto synthesis occurs as a conse- 
quence of mftogenesl8. Until recently, however, little was 
known about the alterations in mRNA translation In canceo% 
and much Is yet to be discovered about their rot© In th<e> 
development and progression of cancer. Here we rayiewth© 
basic principles of transnational control, the* alterations era- 
countered in cancer, and ssJscted therapies toasting transla- 
tion fnfflatibn to elucidate potential new thesBpsutfc avenues. 

Translation Initiation Is the main step In translational regulaijon. 
Trans?a$k>n InitiaiJo^ 

tRNA and the 40$ and 60S ribosomal subunfeererecaru&edto 
the 5' end of a mRNA molecule and assembled by eukaryotlc 
translation Initiation fectoJB into an 80S rfoosoma at tha start 
codon of tha mftt*A {pQ. D). Tha 5' end of eukaryotfcmRNAIs 
capped, La, contains the cap structure m 7 GpppM (7-methyS- 
guanosirt^pho^^'HlboniK^eo^de). Most translation In 
eukaryotes occurs In a cap-dependent feshlon, JLa, the cap te 
specifically recognfeed by tha eJF4E, 3 which binds tha 5' eap» 
The @IRF translation Initiation complex Js then formed by the 
assembly of eJF4£ the RNA he0case*eIF4A, and eiF4Q, a 
scaffolding protein that msdlatas tha btndfr^ of tfta 40S ribo^ 
somal subunlt to tha mR^JA molecule through interaction wfth 
the elF3 protein present on the 40S ribosome. elF4Aand eF4B 
participate In metiing tha secondary sSjucture of ths 5' UTO of 
the mRNA. The 43S Inftl^on complex (40S/eIi=2^Jet-tRNA/ 
QTP complex) scans the mRNA In a 5'-*3 r direction until ft 
encounters an AUG start codon. This start codon Is then base- 
paired to the anticodon of Initiator tRNA, forming the 48$ Initi- 
ation complex. The initiation factors are then displaced from the 
AOS complex, and the 60S ribosome Joins to form the 80S 
ribosome. 

Unlike most eukaiyotte translation, translation Initiation of 
certain mRNAs, such as the picomavirus RNA, Is cap Inde- 
pendent and occure by intwred ribosome entry. This mecha- 
nism does not require elF4£ EBher the 43S complat can bind 
the Initiation codon directly through interaction with the IRES h 
the 5' UTR such as In the encephalomyocarditis virus, or It can 
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Ffr. /. Translation Initiation In eukaryotes. The 4E-BPs are riypemhos- 
ptorytated to release elF4E so that rt can Intact the 5' ce* and the 
e{F4Fjnfttetton complex b assamWed The interaction of poly(A) binding 
protein wfth the Initiation complex and circulartzatlon of the mRNA Is not 
depicted to the diagram. The secondary stjucture of the 6' UTR is matted, 
the 40S ribosomal subuntt Is bound td e!F3, and the ternary complex 
oonafelfng of elFZ. ©TP, and the MeMRNA ere recnitedtothemRNA.Tlie 
rtooscme scans the mRNA fri a 5'-»3' dit^on tmtfl an AUG start codon 
la found In the appropriate sequence contest The Initiation factors are 
released, and the large ribosomal subunft to recruited. 



Inftlalty attach to the IRES and then reach the Initiation codon by 
scanning or transfer, as Is the case with the poltovnus (1). 

Regulation of Translation Initiation 
Translation Initiation can ba regulated by alterations In the 
expression or phosphorylation status of the various factors 
Involved. Key components In transfertionaJ regulation that 
may provide potential therapeutic targets follow. 

©IF4E ©IF4E plays a central role In translation regulation. 
It Is the least abundant of the initiation factors and Is con- 
sidered the rate-limiting component for Initiation of cap- 
dependent translation, e!F4E may also be Involved In mRNA 
splicing, mRNA 3' processing, and mRNA nucfeooytoptes- 
mlc transport (2). elF4E expression can be Increased at the 
transcriptional level In response to serum or growth factors 
(3). e!F4E overexpresslon may cause preferential translation 
of mRSMAs containing excessive secondary structure In their 
5' UTR that are normally discriminated against by the trans- 



laiional machinery and thus are inofflc^ttytrerislated (4-7]), 
As examples of this, overexpresslon of elF4E promotes Ltd- 
creased translation of vascular endothelial growth factor 
fibroblast growth factar-2, and cyclfn D1 (2, 8, ty. 

Another mechanism of control Is firs regulation of e!F4BE 
phosphorylatioa ©0F4E phosphorylation Is mediated by the® 
mftogerhactfvated gmrteln Wr^ase-lnferactlng Ednas© 1, whtelh 
Is activated by the mftogen-atfrvated pathway activatta© 
extracellular signal-related Wnases and the stress-ecttvateccfl 
pathway acting through p38 rrfflogerhactlvated protein kD- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulins 
angiotensin Il p sre kinase overexpresslon, and Has over- 
expression, lead to eIF4E phosphorylaflon (14). Trie phos- 
phorylation status of e3F4E Is usually correlated with the 
translations! rats and growth status of th© cell; however 
elF4E phosphorylation has ateo been observed In response 
to some cellular stresses when translatlonal rates actually 
decrease (15). Thus, further study Is needed to understancS 
the effects of elF4E phosphorylation on eIF4E activity. 

Another mechanism of regulation Is the alteration of eIF4E 
availability by the binding of eIRE to the elR&bindlng prow 
telns (4E-BP, also known as PHA3-J). 4E-BPS compete wWhj 
elF4G for a binding site In ©IRE Th© binding of ©IF4E to th© 
best characterized elF4E4>lndIrrg protein, 4E-BP1, is regto- 
lated by 4E-BP1 phosphccylatioa Hypopho^hcsi^ated 4E- 
BP1 binds to efIF4£ whereas 4E-8P1 hyperphc^rK>ryiattoo - 
decreases this binding. Insulin, angiotensin, epidermal) 
growth factor, platelet-derived growth factor, hepatecyt© 
growth factor, nerve growth factor, insulln-llk© growth factors 
I and II, InterleuWn 3, granulocyte-macrophage co!ony-stin>- * 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to decrease the abHIty of 4E-BP1 to bind eIF4E (15, 10). 
Conversely, deprivation of nutrients or growth factors results 
In 4E-BP1 dephosphorylatlon, an increase In eIF4E binding, 
and a decrease In cap-dependent translation. 

p>70 S® Kinase. Phosphorylatton of ribosomal 40S protein 
S6 by S6K Is thought to play en important role In transMonsd 
regulation. S6K mouse embryonic cells proliferate moro 
slowly than do parental cells, demonstrating that S6K has a 
positive Influence on cell proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ollg^yrlmkftie tract (5 ' TOP)foundat^5 # lTOof rtoosewnai 
protein mRMAs and other mRNAs coding for components of 
the transitional machinery. Phosphorylation of S6K te regu- 
lated In part besed ot the avaBaba^ 
stimulated by several growth factors, such as platelet-derived 
growth factor and Insulin-Oka growth factor J (20). 

<a/NF2m Ptospfoorytaiaon. The binding of the Initiator tRMA 
to the small ribosomal unit Is mediated by translation initia- 
tion factor elF& Phosphoryfation of the a-subunlt of elF2 
prevents formation of the elF2/GTP/Met-tRMA complex and 
inhabits global protein synthesis (21, 22). 8IF2* Is phospho- 
rylated under a variety of conditions, such as viral Infection, 
nutrient deprivation, heme deprivation, and apoptosls (22). 
elF2« Is phosphorylated by hemenreguiated inhibitor, nutrient, 
regulated protein Wnase, and the IFN-induced, double- 
stranded RNA-activated protein Wnase (PKR; Ret. 23). 



Tto mfiPOR SSgnaltag Mtanp. The macrolide antibiotic 
rapamycin (Sfraflmus; Wyeth-Ayeret Research, Collegevflle, 
PA) has been the subject of Intense study because It In- 
hibits signal transduction pathways Involved In T-cofl active- 
tfon. Tte mpamydn-sensftfve component of these pathways 
Is mTOR (also called FRAP or RAFT1). mTOR Is the mam- 
malian homoiogue of the yeast TOR proteins that regulate G, 
progression and translation In response to nutrient availabil- 
ity (24). mTOR Is a serine-threonine kinase that modulates 
translation Initiation by altering the phosphorylation status of 
4E-BP1 and S6K (Rg. 2; Ref. 25). 

4&BP1 is phosphorated on multiple residues, mTOR phos- 
phorytetes the Thr-37 and ThM6 residues of 4&BP1 fri vflro 
(26); hcwever.phosphofyia^a^th^sResfer^a^oc^ 
with a loss of elF4£ binding. Phosphojytetfon of Thr-37 and 
Thr-48 Is required for subsequent phosphorylation at several 
CGOH-tezminai, serum-sensitive sites; a combination of these 
phosphorylation events appears to be needed to Inhibit the 
binding of 4E-BP1 to elF4E ^6). The product of the ATM gene, 
pQBMSKi pathway, and protein kinase Co- also play a rote In 
4&BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt PTCN Is a phosphatase that 
negatively regulates PI3K signaling. PTCN null ceils have 
constitutive!/ active of Akt, with Increased S6K activity and 
S6 phosphorylation (30). S6K activity Is Inhibited both by 
PI3K inhlbitofs wortmannln and LY294002 and by mTOR 
inhibitor rapamycin (24). Akt phosphoryfates Ser-2448 In 
mTOR In vffjo, and this site is phosphorylated upon Akt 
activation fn vivo (31-63). Thus, mTOR is regulated by the 
Pf3K/Akt pathway; however, this does riot appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
Independent of mTOR is controversial. 

Interestingly, mTOR autophosphoryiation is blocked bywort- 
mannln but not by rapamycin (34). TWs seeming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than intrinsic mTOR 
kinase activity. An alternate pathway for4E-BP1 and S8K phos- 
phorylation by mTOR activity is by the Inhibition of a phospha- 
tase. Treatment with calyculin A, an Inhibitor of phosphatases 1 
and 2A, reduces rapamydrrtnduced dephosphorylation of 4E- 
BP1 and S6K by rapamycin (35). PP2A Interacts with full-length 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamycin. mTOR phosphorylates PP2A 
In vtov; however, how this process altera PP2A activity Is not 
known. These results are consistent with the model that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycin block Inhibition of the phosphatase by 
mTOR. 

Polyadenytatlon. The poiy(A) tall In eukaryotlc mRNA Is 
important in enhancing translation Initiation and mRNA sta- 
bility. Potyadenytation plays a key role In regulating gene 
expression during oogenesis and early embryogenesls. 
Some mRNA that are translatlonaliy inactive In the oocyte are 
potyadenylated concomitantly with transtatlonal activation In 
oocyte maturation, whereas other mRNAs that are transla- 
tlonaliy active during oogenesis are deadenyfated and trans- 
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lationally silenced (36-38). Thus, control of poiy(A) tall syn- 
thesis Is an important regulatory step In gene expression. 
TTte 5' cap and pc4y(A) tail are thought to function syneigte- 
tically to regulate mRNA tiansSationa! efficiency (39, 40). 

RNA I^fflc&aisln©. Most RNA-binding proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translaHonal fate of the transcript (41). A highly 
conserved family of Y-box proteins is found In cytoplasmic 
messenger ribonucleoprotefn particles, where the proteins 
are thought to play a role In restricting the recruitment of 
mRNA to the translational machinery (41-43). The major 
mRNA-assodated protein, YB-1 , destabilizes the Interaction 
of elF4E and the 5 r mRNA cap In vitro, and overexpresslon of 
YB-1 results In translatlonal repression In vivo (44). Thus, 
alterations In RNA packaging can also play an Important role 
in translation^ regulation. 

Translator AteraHons Emcountemed In Canceir 
Three main alterations at the translational level occur In cancer 
variations In mRNA sequences that increase or decrease trans- 
lational efficiency, changes In the expression or availability of 
components of the translational machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects thetranslationof an Indi- 
vidual mRNA that may play a rote in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an Increase In the overafl rate of protein synthesis, and the 
transtatlonal activation of several mRNA species. 

VartaMom In mfflm Sequence 
Variations In mRNA sequence affect the translational effi- 
ciency of the transcript. A brief description of these variations 
and examples of each mechanism follow. 

MuWtas. Mutations in the mRNA sequence, especially 
In the 5' UTR, can alter its translatlonal efficiency, as seen In 
the following examples. 
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&>m& Salto Qt ed. proposed that translation of full-length 
c-nwc Is expressed, whereas In several. Buridts lymphomas 
thai have deletions of tfts mRNA5' UTR, translation of c-myc 
Is m era efficient (46), rMore recently, It was reported that the 
5' UTfH of omyc contains an IRES, and thus <>my© transla- 
tion can be Initiated by a cap-Independent as mil as a 
cap-dapendent mechanism (46, 47). m patients with multiple 
myeloma, a C-*T mutation In the o-myc IRES was Identified 
(48) and found to cause an enhanced Initiation of translation 
via Internal ribosomaJ entry (49). 
. mQ&ti 0 A somatic point mutation (117 G-»C) In position 
-3 with respect to the start codon of the BRCA1 gene was 
identified In a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wlld-typa or mutated 
BRGA1 5' UTR and a downstream iudferase reporter dem- 
onstrated a decrease in the translations] efffctency with the 5' 
UTR mutation. 

^Mss^toJ fflms® Mmmr 2A, Some Inherited 
melanoma kindreds have a G-»T transversion at base -34 
of cycln-dependent kinase lnhibltor-2A, which encodes a 
cycfln-depsndent kinase 4/cydIn-dependent kinase 6 kinase 
Inhibitor Important In G 1 checkpoint regulation (51). This 
mutation,glves rise to a novel AUG translation Initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning rfbosomes and decreases translation 
from the ttrild-typs AUG. 

Sfltes. Alterations In splicing and alternate transcription sites 
can feed to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting translations! efficiency. 

AIM. The ATM gene has four noncoding exons in its 5' 
UTR that undergo extensive alternative spHcIng (52). The 
contents of 12 different 5' UTRs that show considerable 
diversity In length and sequence have been Identified. These 
divergent 5' leader sequences play an Important role in the 
translations! regulation of the ATM gen©. 

mdtea. In a subset of tumors, overexpresslon of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only In their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA Is loaded with ribosomes Inefficiently compared with 
the short 5' UTR. 

8RCA1. in a normal mammary gland, BRCA1 mRNA Is 
expressed with a shorter leader sequence (5'UTRa), whereas 
In sporadic breast cancer tissue, BRCA1 mRNA is expressed 
with a longer leader sequence (5 # UTRb); the translational 
efficiency of transcripts containing 5' UTRb Is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

rep-pa TGF-03 mRNA includes a 1.1 -kb 5' UTR, which 
exerts an Inhibitory effect on translatloa Many human breast 
cancer ce9 lines contain a novel TGF-p3 transcript with a 5' 
UTR that is 870 nucleotides shorter and has a 7-fold greater 
transnational efficiency than the normal TQF-f& mRNA (56). 

A&forasto (P08yad®rsy8a^off& §fte& Multiple polyadenyl- 
atlon signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the RET proto-oncogene (56), ATM gene 
(5?), tissue inhibitor of metaJloprotelnases-3 (57), WHOA 



prato-cncogane (58), and calmodulin-* (59). Although tto<s> 
effect of these alternate 3' UTOs on transition Is not yerfi 
known, they may be Important In RNAiproteln Interaction)© 
thai affect translational recruitment The role of these alter- 
ations In cancer development and progression Is unknowns 

Mwafifam As to Qempwsmte of 

Alterations In the components of translation machinery cam) 
take many forms. 

®v®mq?W3S&m «rf Overexpresslon of ©IRE 

causes malignant transformation In rodent cells (80) and the 
deregulation of HeLa cell growth (81). Polunovsky otaL (62]) 
found that elF4E overexpresston substitutes for serum and) 
Individual growth factors In preserving viablBty of fibroblasts, 
which suggests that elRE can mediate both proliferative and) 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broadl 
spectrum of transformed ceO lines (63). elF4E levels am 
elevated In all ductal carcinoma In sto/ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies si^est that this overexpresslon is attributable 
to gene amplification (68). 

presskm can be valuable as a prognostic mater. ©IF4E over- 
expression was found Inaretroepec^stuctytobsanffirtocjf 
poor prognosis In stages I to III breast (caidnon^ (67). Vena- 
tion of the prcgrrasiic vaJua of eIF4E In to^asi cancer now 
under way h a prospective trial (67). However, In a different 
study, elRE expression was correlated wHfa the aggressive 
behavior of ncn-Hodgkin's lymphomas (68). In a prospsctivs 
analyst of patients with hsad end neck cancer, elevated tevete 
of ©IRE In histologically tumor-free surgical margins predicted 
a significantly Increased risk of CocaJ-regional recurrence (9). 
These results aS suggest that eIF4E overaxpresslon can ba 
used to select patients who might bsrafit from mcra aggressive 
systemic therapy. Furthem^ 

suggest that efF4E overexpresslon is a field defect and can ba 
used to guide local therapy. 

ABteirsrfitons 8n €m®? BnKflation Factors. Alterations In a 
number of other Initiation factors have been associated with 
cancer. Overproduction of e!F4G, similar to elF4E, leads to 
malignant transformaUon tn vitro (69). elF-2a Is found In 
Increased levels In bronchloloalveolar carcinomas of the lung 
(3). Initiation factor elF-4Al is overexpnassed In melanoma 
(70) and hepatocellular carcinoma (71). me p40 subunlt of 
translation Initiation factor 3 Is amplified and overexpressed 
in breast and prostate cano8r(72), and the elF3-p1 10 subunlt 
Is overexpressed in testicular seminoma (73). The role that 
overexpresslon of these initiation factors plays on the devel- 
opment and progression of cancer, If any, Is not known. 

©vere«ip3\9Ssloin) off mi S6K Is amplified and highly 
overexpressed In the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (74). In a study by 
Bariund et a(. (74), S6K was amplified In 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 



©vem^iguisasflosB off PWp PAP catalyzes 3' poly W syn- 
thesis. PAP IsoveraxpTessed In human cancer cells awn- 
pared with normal and vfcally transformed cells (75). PAP 
enzymatic activity in breast tomore has been correlated^ 
PAP protein levels (76) and, In nummary tumor cytosots, was 
found to bo an Independent facto? for predicting survival (76). 
little Is known, however, about how PAP expression or ac- 
tivity affects th* translational profile. 

Mmfilvm ^m^Wta® ^afm Even tess known 
about alterations {n RNA packaging In cancer, taeafied ex- 
pression and nuclear localization of the RNA-blndlng protein 
YB-1 am Incficatojs ctf a poor prognosis to breast cancer (77), 
non-smaB oaO lung cancer (78), and ovarian cancer (76). How- 
ever, this effect may be mediated at least fri part at the level of 
transcription, because YB-1 Incjea^chemojeslsiancebyen- 
handng the transcription of a mufifdnig restetence gerta (80). 



Actih&ttkm off Stgnall Ymm^mtikvn Pathways 
Activation of signal transduction pathways by toss of tumor 
suppressor genes or weroxpaesslon of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
bnportant^rnutant in human cancers Is the tumor suppressor 
geneP7BV,whk* leads to the ec 

way. Activation <rf and Akt Induces the crtccgsnta trans- 
formation cfi ctaten embryo fibroWasia Ths transformed cdte 
show constitutive phosphorylation o? S0K and of 4E-BP1 (81). 
A mutant AW that retahs kinase activity but does not phos- 
phoryiate S6K or 4E-BP1 does not transform fibroblasts, which 
sugg ests a correlation between the oncogenicity of PJ3K and 
Akt artd the phosphorylation of $&K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, (nsulMke growth factor, HER2/heu, and epidermal 
growth factor receptor are overexpressed In cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation Is likely to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translation^ control. 
For example, HER2/neu mRNA Is translationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation In a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that increases 
transnational efficiency (82). HER2rtieu translation Is different 
in transformed and normal cells. THus, it is possible that 
alterations at the translattonal level can In part account for 
the discrepancy between HEft2/neu gene amplification de- 
tected by fluorescence In sttu hybridization and protein levels 
detected by Immunohlstochemlcal assays. 



TorosMotfl) Y&mgsfts.o? Selected Gaine&r TUiempy 
Components of the translation machinery and signal path- 
ways Involved In the activation of translation Initiation repre- 
sent good targets for cancer therapy. 




Rapamycln Inhibits the proliferation of lymphocytes. It was 
Initially developed as an immunosuppressive drug for organ 



transplantation, Rapamy* with FKBP 12 (FK608-b5ndfcni£j 
protein, M 9 12,000) binds to mTDR to Inhibit Re taction. 

Rapamycln causes a small but significant reduction In £ft© 
Initiation rate of protein synthesis (83). It blocks cell growth D \n 
part by blocking S3 ptosphciytatkm and selectively $up>- 
prossing the translation of 5 ' TOP mRNAe. such asribosonnsiD 
proteins, and elongation factors (83-85). Rapamydn ateo 
blocks 4S-BP1 phosphorylation and inhibits cap-dependecmii 
but not cap-4ndependenS translation (17, 86). 

The rapamycta-senstffo© sSgnal transduction pathway, acti- 
vated during maflgnanttrans^^ 
Is now being studied as a tai^ for canw 
taie l bre^smaBcafllimg l g 

leukemia are among She cancer Ones most sensitive to th© 
rapamyc&B analogue CC8-779 (Vfysih^yersfc Research; Raff. 
S^.torhabdornycosa 

static or eytocldal, deperafing on the p63 status off the oafcpSS 
wild-type cefls treated with raparrtycin arrest In the Gt phase 
and maintain thsJr.viabHity, whenaasp53nfurtanto8ifeaccunn^ 
late in G, and undergo apoptosSs (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
medullobfastoma models, rapamycln exhibited more cytotox- 
icity In combination with dspfatin and camptothecin than as a 
stogie agent. In vfw>, CQ-779 delayed giowthofxem^fisbjr 
160% afterl week off therapy and 240% after2 weeks. AsfngI© 
hlgh-dosa administration caused s 37% decrease fn tow 
vduma Growth inhibition fn vim was 13 times greater, wfttib 
cisplatin In combination with CC3-779 than with cfsptetin alone 
(30). Thus, preclinical studies suggest that rapamycln ana- 
logues are useful as single agents and in combination wfafo 
chemotherapy. 

Rapamycln analogues CCl-779 and RAD001 (WovarUs, 
Basel. Switzerland) are now In clinical trials. Because of the 
known effect of rapamycln on lymphocyte proliferation, a 
potential problem with rapamycln analogues is Immunosup- 
pression. However, although prolonged Immunosuppression 
can result from rapamycln and CCl-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycln analogues resolve In -24 h after therapy 
(91). The principal toxicities of CCl-779 have included dor- 
matologica! toxicity, myeiosuppresslon, infection, mucositis, 
diarrhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-04). Phase II trials of CCl-779 have been conducted In 
advanced renal cell carcinoma and In stage Ill/iV breast 
carcinoma patients who felled with prior chemotherapy. In 
the results reported In abstract form, although there were no 
complete responses, partial responses were documented In 
both renal cell carcinoma and In breast carcinoma (94, 95). 
Thus, CCl-779 has documented preliminary clinical activtty In 
a previously treated, unselected patient population. 

Active Investigation Is underway into patient selection for 
mTOR Inhibitors. Severn! studies have found an enhanced 
efficacy of CCl-779 In PTEN-null tumors (30, 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCl-779, although only two of these lines 
lacked PTEN (97) There was, however, a positive correlation 
between Akt activation and CCl-779 sensitivity (97), This 
correlation suggests that activation of the PI3K-Akt pathway, 
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regardless of whether It te attributable to a PTESS5 mutation or 
to ovenaxpression of receptor tyrosine kinases, makes can- 
cer cell amenable to mTOR-dfrected therapy. In contrast, 
lower levels <rf the target o? mTOR, 4E-BP1, are associated 
with rapamycln resistance; thus, a lower 4E-BP1/elF4£ ratio 
may pmflct rapamycln resistance (98). 

Anottw mode of activity for repamycln and Its analogues 
appears to be through Inhibition of anglogsnesls. This activ- 
ity may be both through direct Inhibition of eraibtheBal cell 
proliferation as a result of mTOR Inhibition In these cells or by 
Inhibition of translation of such proanglogenlc factors as 
vascular endothelial growth factor In tumor cells (99, 1 CO). 

The anglogenesis Inhibitor tumstafln, another anticancer 
drug currency understudy, was also found recently to Inhibit 
translation In endothelial ceOs (101). Through a requisite In- 
teraction with Integsin, tumstatln inhibits activation of the 
PI3K/Akt pathway and mTOR In endothelial ce3b and pre- 
vents dissociation of eIRE from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en* 
dotheOal cells are especially sensitive to therapies targeting 
the mTOFVslgnallng pathway. 

TbrgefiHfr oS/FSgs EPA, CtoMsms&fe, mOsh7 9 

EPA Is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low Incidence of cancer 
(102). EPA Inhibits the proliferation of cancer ceils (103), as 
wen as In animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
Intracellular stores while inhibiting their, refining, thereby ac- 
tivating PKR PKR, in turn phosphorylates and inhibits eU=2o, 
resulting in the Inhibition of protein synthesis at the level of 
translation Initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent ?n vitro and In vfoo, Inhibits cell growth through 
depletion of Ca 2+ stores, activation of PKR, and phospho- 
rylation of a!F2c (108). Consequently, clotrimazole preferen- 
tially decreases the expression of cycllns A, E, and D1, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent Adenoviral transfer of mda-7 (Ad- 
mda7) Induces apoptosls In many cancer cells Including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
Induces and activates PKR, which leads to phosphorylation 
of eIF2a and Induction of apoptosls (110). 

Flavonolds such as genlsteln and quercetin suppress tu- 
mor cefl growth. All three mammalian e!F2a kinases, PKR, 
heme-reguteted Inhibitor, and PERK/PEK, are activated by 
flavonolds, with phosphorylation of elF2* and Inhibition of 
protein synthesis (111). 



TawQltifiig.oitim and #C An&smsa Mm 
and F®ptikfl®s • 

Antisense expression of elF4A decreases the proliferation rate 
of melanoma cells (112). Sequestration of e!F4E by overexpres- 
slon of 4E-BP1 Is proapoptotfc and decreases tumorigenlctty 
(113, 114), Reduction of elF4E with antisense RNA decreases 
soft agar growth, Increases tumor latency, and Increases the 
rates of tumor doubling times (7). Antisense eiF4E RNA treat- 



ment also reduces the expression of angiogenic fectaa (11^ 
and has been pmpossd as a potential adj wwrft therapy for hsaatf 
amSi^amceTBiipatfcul^ dra 
apical margins. Small moteculo &M>^thai^tfcee5F4Q/ 
4E-BP1 -binding domain off elRE are preapoptoHc (116) an<dJ 
are also being actfvery pursued. 



Bqpfoffln® SsfeSto Dmmh^om fc* Mm&jpy 
A different therapeutic approach that takes advantage of tlfoa 
enhanced cap-dependent transition in cancer ce8s tethe us© 
of gene therapy vectore encoding su&de genes with h^hO^ 
structured 5' UTR These nrtRWA would thus beatacompst^a 
disadvantage In nomral cells and not translate well, wftsres® 5ru 
cancer oslis, they would translate more efffcterrby. For ewamptep 
the ^traduction of ttts 5' UTR of faire!^gro^fecto^2S r to 
the ceding sequarcoa of hsrpas sfmptex vims fyp& 1 thymkfirm 
fdhasa gsns^ allows far esJecflws tran^ion of topes s&r^x^asr 
virus fypa-1 tfymfdlrw kinase gene in breast cancer ceG Ones 
compared with normal mammary ceS Unas and results in se- 
lective sensitivity to gancteJcvtr (1 1 7). 



Translation b a crucial preosss In ©very celL However, several 
alterations to translaHenal control occur In cara^.Ganssrcefe 
appear to need an aberrantly activated tran^ationaa state for 
survival thus allowing the tengeting of translarScn Infflaiton wrift 
surprisingly low toxicity. Components of the translaHcnal 
chineyy, such as e!F4£, and signal transduction pathways &>- 
voived fn translation Initiation, such mTOR, represent pmm&irtgj 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials, it is possible 
that with the development of better predictive maikere and 
better patient selection, response rates to sJngte-ageni therapy 
can be Improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most fficeiy to achfeftfe clinical utility Bn 
combination therapy. In the interim, our increasing understand- 
ing of translation initiation and signal transduction pathways 
premise to tod to the Identification of new therapeutic targets 
in the near future. 
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